Nucleic acids that control endosperm development in plants

ABSTRACT

The invention provides methods of controlling endosperm development in plants.

CROSS REFERENCE TO RELATED APPLICATIONS

This is a continuation-in-part of U.S. Ser. No. 09/071,838 (pending),filed May 1, 1998, which is incorporated herein by reference.

This invention was made with Government support under Award No.95-37304-2329, awarded by the United States Department of Agricultureand Award Nos. 97-34339-4954 and Award No. 96-00242/1, awarded by theUnited States-Israel Binational Agricultural Research and DevelopmentFund. The Government has certain rights in this invention.

FIELD OF THE INVENTION

The present invention is directed to plant genetic engineering. Inparticular, it relates to modulation of expression of genes controllingendosperm development in plants.

BACKGROUND OF THE INVENTION

A fundamental problem in biology is to understand how fertilizationinitiates reproductive development. In higher plants, the ovulegenerates the female gametophyte which is composed of egg, central,synergid and antipodal cells (Reiser, et al., Plant Cell, 1291-1301(1993)). All are haploid except the central cell which contains twodaughter nuclei that fuse prior to fertilization. One sperm nucleusfertilizes the egg to form the zygote, whereas another sperm nucleusfuses with the diploid central cell nucleus to form the triploidendosperm nucleus (van Went, et al., Embryology of Angiosperms, pp.273-318 (1984)). The two fertilization products undergo distinctpatterns of development. In Arabidopsis, the embryo passes through aseries of stages that have been defined morphologically as preglobular,globular, heart, cotyledon and maturation (Goldberg, R. B., et al.,Science (1994) 266: 605-614; Mansfield, S. G., et al., Arabidopsis: AnAtlas of Morphology and Development, pp. 367-383 (1994)). The primaryendosperm nucleus undergoes a series of mitotic divisions to producenuclei that migrate into the expanding central cell (Mansfield, S. G.,et al., Arab Inf Serv 27: 53-64 (1990); Webb, M. C., et al., Planta 184:187-195 (1991)). Cytokinesis sequesters endosperm cytoplasm and nucleiinto discrete cells (Mansfield, S. G., et al., Arab Inf Serv 27: 65-72(1990)) that produce storage proteins, starch, and lipids which supportembryo growth (Lopes, M. A. et al., Plant Cell 5: 1383-1399 (1993)).Fertilization also activates development of the integument cell layersof the ovule that become the seed coat, and induces the ovary to growand form the fruit, or silique, in Arabidopsis.

Control of the expression of genes that control egg and central celldifferentiation, or those that activate reproductive development inresponse to fertilization is useful in the production of plants with arange of desired traits. These and other advantages are provided by thepresent application.

SUMMARY OF THE INVENTION

The present invention provides methods of modulating fruit and seeddevelopment and other traits in plants. The methods involve providing aplant comprising a recombinant expression cassette containing an FIEnucleic acid linked to a plant promoter.

In some embodiments, transcription of the FIE nucleic acid inhibitsexpression of an endogenous FIE gene or activity the encoded protein.This embodiment is particularly useful, for instance, making embryo-lessseed and parthenocarpic fruit. Alternatively, expression of the FIEnucleic acid may enhance expression of an endogenous FIE gene or FIEactivity

In the expression cassettes, the plant promoter may be a constitutivepromoter, for example, the CaMV 35S promoter. Alternatively, thepromoter may be a tissue-specific promoter. Examples of tissue specificexpression useful in the invention include ovule-specific orembryo-specific expression. For instance, the promoter sequence from theFIE genes disclosed here can be used to direct expression in relevantplant tissues.

The invention also provides seed or fruit produced by the methodsdescribed above. The seed or fruit of the invention comprise arecombinant expression cassette containing an FIE nucleic acid.

Definitions

The phrase “nucleic acid sequence” refers to a single or double-strandedpolymer of deoxyribonucleotide or ribonucleotide bases read from the 5′to the 3′ end. It includes chromosomal DNA, self-replicating plasmids,infectious polymers of DNA or RNA and DNA or RNA that performs aprimarily structural role.

A “promoter” is defined as an array of nucleic acid control sequencesthat direct transcription of an operably linked nucleic acid. As usedherein, a “plant promoter” is a promoter that functions in plants.Promoters include necessary nucleic acid sequences near the start siteof transcription, such as, in the case of a polymerase II type promoter,a TATA element. A promoter also optionally includes distal enhancer orrepressor elements, which can be located as much as several thousandbase pairs from the start site of transcription. A “constitutive”promoter is a promoter that is active under most environmental anddevelopmental conditions. An “inducible” promoter is a promoter that isactive under environmental or developmental regulation. The term“operably linked” refers to a functional linkage between a nucleic acidexpression control sequence (such as a promoter. or array oftranscription factor binding sites) and a second nucleic acid sequence,wherein the expression control sequence directs transcription of thenucleic acid corresponding to the second sequence.

The term “plant” includes whole plants, plant organs (e.g., leaves,stems, flowers, roots, etc.), seeds and plant cells and progeny of same.The class of plants which can be used in the method of the invention isgenerally as broad as the class of higher plants amenable totransformation techniques, including angiosperms (monocotyledonous anddicotyledonous plants), as well as gymnosperms. It includes plants of avariety of ploidy levels, including polyploid, diploid, haploid andhemizygous.

A polynucleotide sequence is “heterologous to” an organism or a secondpolynucleotide sequence if it originates from a foreign species, or, iffrom the same species, is modified from its original form. For example,a promoter operably linked to a heterologous coding sequence refers to acoding sequence from a species different from that from which thepromoter was derived, or, if from the same species, a coding sequencewhich is different from any naturally occurring allelic variants.

A polynucleotide “exogenous to” an individual plant is a polynucleotidewhich is introduced into the plant by any means other than by a sexualcross. Examples of means by which this can be accomplished are describedbelow, and include Agrobacterium-mediated transformation, biolisticmethods, electroporation, and the like. Such a plant containing theexogenous nucleic acid is referred to here as an R₁ generationtransgenic plant. Transgenic plants which arise from sexual cross or byselfing are descendants of such a plant.

A “FIE nucleic acid” or “FIE polynucleotide sequence” of the inventionis a subsequence or full length polynucleotide sequence of a gene whichencodes a polypeptide involved in control of reproductive developmentand which, when mutated, allows for aspects of fertilization independentreproductive development. In some embodiments, the polypeptides of theinvention have substantial sequence identity (as defined below) to apolycomb group gene of Drosophila. An exemplary nucleic acid of theinvention is the Arabidopsis FIE1 and FIE3 sequences disclosed below.FIE polynucleotides are defined by their ability to hybridize underdefined conditions to the exemplified nucleic acids or PCR productsderived from them. An FIE polynucleotide is typically at least about30-40 nucleotides to about 3000, usually less than about 5000nucleotides in length. The nucleic acids contain coding sequence of fromabout 100 to about 2000 nucleotides, often from about 500 to about 1700nucleotides in length.

FIE nucleic acids are a new class of plant regulatory genes that encodepolypeptides with sequence identity to members of the polycomb groupgenes first identified in Drosophila. Polycomb group gene products andtheir homologues in other species are responsible for repression ofhomeotic genes. The proteins are a heterogenous group that interact witheach other to form large complexes that bind DNA and thereby controlgene expression. For a review of the current understanding of polycombcomplex genes see, Pirrotta Cur. Op. Genet. Dev. 7:249-258 (1997). Ninegroups of polycomb genes have been identified. FIE1 (SEQ ID NO:1) isrelated to the group of polycomb genes encoding protein comprising a SETdomain (see, e.g., Jenuwein et al. Cell. Mol. Life Sci. 54:80-93 (1998).FIE3 (SEQ ID NO:3) is related to the group encoding proteins comprisingWD40 repeats (see, Gutjahr et al. EMBO J. 14:4296-4306 (1995).

In the case of both expression of transgenes and inhibition ofendogenous genes (e.g., by antisense, or sense suppression) one of skillwill recognize that the inserted polynucleotide sequence need not beidentical, but may be only “substantially identical” to a sequence ofthe gene from which it was derived. As explained below, thesesubstantially identical variants are specifically covered by the termFIE nucleic acid.

In the case where the inserted polynucleotide sequence is transcribedand translated to produce a functional polypeptide, one of skill willrecognize that because of codon degeneracy a number of polynucleotidesequences will encode the same polypeptide. These variants arespecifically covered by the terms “FIE nucleic acid”. In addition, theterm specifically includes those sequences substantially identical(determined as described below) with an FIE polynucleotide sequencedisclosed here and that encode polypeptides that are either mutants ofwild type FIE polypeptides or retain the function of the FIE polypeptide(e.g., resulting from conservative substitutions of amino acids in theFIE polypeptide). In addition, variants can be those that encodedominant negative mutants as described below.

Two nucleic acid sequences or polypeptides are said to be “identical” ifthe sequence of nucleotides or amino acid residues, respectively, in thetwo sequences is the same when aligned for maximum correspondence asdescribed below. The terms “identical” or percent “identity,” in thecontext of two or more nucleic acids or polypeptide sequences, refer totwo or more sequences or subsequences that are the same or have aspecified percentage of amino acid residues or nucleotides that are thesame, when compared and aligned for maximum correspondence over acomparison window, as measured using one of the following sequencecomparison algorithms or by manual alignment and visual inspection. Whenpercentage of sequence identity is used in reference to proteins orpeptides, it is recognized that residue positions that are not identicaloften differ by conservative amino acid substitutions, where amino acidsresidues are substituted for other amino acid residues with similarchemical properties (e.g., charge or hydrophobicity) and therefore donot change the functional properties of the molecule. Where sequencesdiffer in conservative substitutions, the percent sequence identity maybe adjusted upwards to correct for the conservative nature of thesubstitution. Means for making this adjustment are well known to thoseof skill in the art. Typically this involves scoring a conservativesubstitution as a partial rather than a full mismatch, therebyincreasing the percentage sequence identity. Thus, for example, where anidentical amino acid is given a score of 1 and a non-conservativesubstitution is given a score of zero, a conservative substitution isgiven a score between zero and 1. The scoring of conservativesubstitutions is calculated according to, e.g., the algorithm of Meyers& Miller, Computer Applic. Biol. Sci. 4:11-17 (1988) e.g., asimplemented in the program PC/GENE (Intelligenetics, Mountain View,Calif., USA).

The phrase “substantially identical,” in the context of two nucleicacids or polypeptides, refers to sequences or subsequences that have atleast 60%, preferably 80%, most preferably 90-95% nucleotide or aminoacid residue identity when aligned for maximum correspondence over acomparison window as measured using one of the following sequencecomparison algorithms or by manual alignment and visual inspection. Thisdefinition also refers to the complement of a test sequence, which hassubstantial sequence or subsequence complementarity when the testsequence has substantial identity to a reference sequence.

For sequence comparison, typically one sequence acts as a referencesequence, to which test sequences are compared. When using a sequencecomparison algorithm, test and reference sequences are entered into acomputer, subsequence coordinates are designated, if necessary, andsequence algorithm program parameters are designated. Default programparameters can be used, or alternative parameters can be designated. Thesequence comparison algorithm then calculates the percent sequenceidentities for the test sequences relative to the reference sequence,based on the program parameters.

A “comparison window”, as used herein, includes reference to a segmentof any one of the number of contiguous positions selected from the groupconsisting of from 20 to 600, usually about 50 to about 200, moreusually about 100 to about 150 in which a sequence may be compared to areference sequence of the same number of contiguous positions after thetwo sequences are optimally aligned. Methods of alignment of sequencesfor comparison are well-known in the art. Optimal alignment of sequencesfor comparison can be conducted, e.g., by the local homology algorithmof Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homologyalignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970),by the search for similarity method of Pearson & Lipman, Proc. Nat'l,Acad. Sci. USA 85:2444 (1988), by computerized implementations of thesealgorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin GeneticsSoftware Package, Genetics Computer Group, 575 Science Dr., Madison,Wis.), or by manual alignment and visual inspection.

One example of a useful algorithm is PILEUP. PILEUP creates a multiplesequence alignment from a group of related sequences using progressive,pairwise alignments to show relationship and percent sequence identity.It also plots a tree or dendogram showing the clustering relationshipsused to create the alignment. PILEUP uses a simplification of theprogressive alignment method of Feng & Doolittle, J. Mol. Evol.35:351-360 (1987). The method used is similar to the method described byHiggins & Sharp, CABIOS 5:151-153 (1989). The program can align up to300 sequences, each of a maximum length of 5,000 nucleotides or aminoacids. The multiple alignment procedure begins with the pairwisealignment of the two most similar sequences, producing a cluster of twoaligned sequences. This cluster is then aligned to the next most relatedsequence or cluster of aligned sequences. Two clusters of sequences arealigned by a simple extension of the pairwise alignment of twoindividual sequences. The final alignment is achieved by a series ofprogressive, pairwise alignments. The program is run by designatingspecific sequences and their amino acid or nucleotide coordinates forregions of sequence comparison and by designating the programparameters. For example, a reference sequence can be compared to othertest sequences to determine the percent sequence identity relationshipusing the following parameters: default gap weight (3.00), default gaplength weight (0.10), and weighted end gaps.

Another example of algorithm that is suitable for determining percentsequence identity and sequence similarity is the BLAST algorithm, whichis described in Altschul et al., J. Mol. Biol. 215:403-410 (1990).Software for performing BLAST analyses is publicly available through theNational Center for Biotechnology Information(http://www.ncbi.nlm.nih.gov/). This algorithm involves firstidentifying high scoring sequence pairs (HSPs) by identifying shortwords of length W in the query sequence, which either match or satisfysome positive-valued threshold score T when aligned with a word of thesame length in a database sequence. T is referred to as the neighborhoodword score threshold (Altschul et al, supra). These initial neighborhoodword hits act as seeds for initiating searches to find longer HSPscontaining them. The word hits are extended in both directions alongeach sequence for as far as the cumulative alignment score can beincreased. Extension of the word hits in each direction are halted when:the cumulative alignment score falls off by the quantity X from itsmaximum achieved value; the cumulative score goes to zero or below, dueto the accumulation of one or more negative-scoring residue alignments;or the end of either sequence is reached. The BLAST algorithm parametersW, T, and X determine the sensitivity and speed of the alignment. TheBLAST program uses as defaults a wordlength (W) of 11, the BLOSUM62scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA89:10915 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=−4,and a comparison of both strands.

The BLAST algorithm also performs a statistical analysis of thesimilarity between two sequences (see, e.g., Karlin & Altschul, Proc.Nat'l. Acad. Sci. USA 90:5873-5787 (1993)). One measure of similarityprovided by the BLAST algorithm is the smallest sum probability (P(N)),which provides an indication of the probability by which a match betweentwo nucleotide or amino acid sequences would occur by chance. Forexample, a nucleic acid is considered similar to a reference sequence ifthe smallest sum probability in a comparison of the test nucleic acid tothe reference nucleic acid is less than about 0.2, more preferably lessthan about 0.01, and most preferably less than about 0.001.

“Conservatively modified variants” applies to both amino acid andnucleic acid sequences. With respect to particular nucleic acidsequences, conservatively modified variants refers to those nucleicacids which encode identical or essentially identical amino acidsequences, or where the nucleic acid does not encode an amino acidsequence, to essentially identical sequences. Because of the degeneracyof the genetic code, a large number of functionally identical nucleicacids encode any given protein. For instance, the codons GCA, GCC, GCGand GCU all encode the amino acid alanine. Thus, at every position wherean alanine is specified by a codon, the codon can be altered to any ofthe corresponding codons described without altering the encodedpolypeptide. Such nucleic acid variations are “silent variations,” whichare one species of conservatively modified variations. Every nucleicacid sequence herein which encodes a polypeptide also describes everypossible silent variation of the nucleic acid. One of skill willrecognize that each codon in a nucleic acid (except AUG, which isordinarily the only codon for methionine) can be modified to yield afunctionally identical molecule. Accordingly, each silent variation of anucleic acid which encodes a polypeptide is implicit in each describedsequence.

As to amino acid sequences, one of skill will recognize that individualsubstitutions, deletions or additions to a nucleic acid, peptide,polypeptide, or protein sequence which alters, adds or deletes a singleamino acid or a small percentage of amino acids in the encoded sequenceis a “conservatively modified variant” where the alteration results inthe substitution of an amino acid with a chemically similar amino acid.Conservative substitution tables providing functionally similar aminoacids are well known in the art.

The following six groups each contain amino acids that are conservativesubstitutions for one another:

1) Alanine (A), Serine (S), Threonine (T);

2) Aspartic acid (D), Glutamic acid (E);

3) Asparagine (N), Glutamine (Q);

4) Arginine (R), Lysine (K);

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). (see, e.g.,Creighton, Proteins (1984)).

An indication that two nucleic acid sequences or polypeptides aresubstantially identical is that the polypeptide encoded by the firstnucleic acid is immunologically cross reactive with the antibodiesraised against the polypeptide encoded by the second nucleic acid. Thus,a polypeptide is typically substantially identical to a secondpolypeptide, for example, where the two peptides differ only byconservative substitutions. Another indication that two nucleic acidsequences are substantially identical is that the two molecules or theircomplements hybridize to each other under stringent conditions, asdescribed below.

The phrase “selectively (or specifically) hybridizes to” refers to thebinding, duplexing, or hybridizing of a molecule only to a particularnucleotide sequence under stringent hybridization conditions when thatsequence is present in a complex mixture (e.g., total cellular orlibrary DNA or RNA).

The phrase “stringent hybridization conditions” refers to conditionsunder which a probe will hybridize to its target subsequence, typicallyin a complex mixture of nucleic acid, but to no other sequences.Stringent conditions are sequence-dependent and will be different indifferent circumstances. Longer sequences hybridize specifically athigher temperatures. An extensive guide to the hybridization of nucleicacids is found in Tijssen, Techniques in Biochemistry and MolecularBiology—Hybridization with Nucleic Probes, “Overview of principles ofhybridization and the strategy of nucleic acid assays” (1993).Generally, highly stringent conditions are selected to be about 5-10° C.lower than the thermal melting point (T_(m)) for the specific sequenceat a defined ionic strength pH. Low stringency conditions are generallyselected to be about 15-30° C. below the T_(m). The T_(m) is thetemperature (under defined ionic strength, pH, and nucleicconcentration) at which 50% of the probes complementary to the targethybridize to the target sequence at equilibrium (as the target sequencesare present in excess, at T_(m), 50% of the probes are occupied atequilibrium). Stringent conditions will be those in which the saltconcentration is less than about 1.0M sodium ion, typically about 0.01to 1.0M sodium ion concentration (or other salts) at pH 7.0 to 8.3 andthe temperature is at least about 30° C. for short probes (e.g., 10 to50 nucleotides) and at least about 60° C. for long probes (e.g., greaterthan 50 nucleotides). Stringent conditions may also be achieved with theaddition of destabilizing agents such as formamide. For selective orspecific hybridization, a positive signal is at least two timesbackground, preferably 10 time background hybridization.

Nucleic acids that do not hybridize to each other under stringentconditions are still substantially identical if the polypeptides whichthey encode are substantially identical. This occurs, for example, whena copy of a nucleic acid is created using the maximum codon degeneracypermitted by the genetic code. In such cased, the nucleic acidstypically hybridize under moderately stringent hybridization conditions.

In the present invention, genomic DNA or cDNA comprising FIE nucleicacids of the invention can be identified in standard Southern blotsunder stringent conditions using the nucleic acid sequences disclosedhere. For the purposes of this disclosure, suitable stringent conditionsfor such hybridizations are those which include a hybridization in abuffer of 40% formamide, 1M NaCl, 1% SDS at 37° C., and at least onewash in 0.2X SSC at a temperature of at least about 50° C., usuallyabout 55 ° C. to about 60° C., for 20 minutes, or equivalent conditions.A positive hybridization is at least twice background. Those of ordinaryskill will readily recognize that alternative hybridization and washconditions can be utilized to provide conditions of similar stringency.

A further indication that two polynucleotides are substantiallyidentical is if the reference sequence, amplified by a pair ofoligonucleotide primers, can then be used as a probe under stringenthybridization conditions to isolate the test sequence from a cDNA orgenomic library, or to identify the test sequence in, e.g., a northernor Southern blot.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A and 1B show the genetic map used to clone the FIE3 gene.

FIG. 2 shows the analysis of the sequence in the DNA shown in FIG. 1using the GENSCANW program.

FIG. 3 shows the position of primers used to PCR amplify sequences fromthe FIE3 gene region.

FIG. 4 shows the genetic map used to clone the FIE1 gene.

FIG. 5 shows the results of complementation tests establishing that asingle gene (FIE1) was present on the complementing cosmid (6-22) thatwas not fully encoded on either of the non-complementing cosmids (2-9and 2-8).

DESCRIPTION OF THE PREFERRED EMBODIMENTS

This invention provides molecular strategies for controlling seed andfruit development.

Reproduction in higher plants is unique because it is initiated by twofertilization events in the haploid female gametophyte. One spermnucleus fertilizes the egg to form the embryo. A second sperm nucleusfertilizes the central cell to form the endosperm, a unique tissue thatsupports the growth of the embryo. Fertilization also activates maternaltissue differentiation, the ovule integuments form the seed coat and theovary forms the fruit.

The present invention is based, at least in part, on the discovery of aset of female-gametophytic mutations, termed fie(fertilization-independent endosperm), and the subsequent cloning of thegenes involved. Three mutants are disclosed here fie1, fie2, and fie3,which have been mapped to chromosomes 1, 2, and 3 of Arabidopsis,respectively. The fie mutations affect the central cell, allowing forreplication of the central cell nucleus and endosperm developmentwithout fertilization. FIE/fie seed coat and fruit undergofertilization-independent differentiation, showing that the fie femalegametophyte is the source of signals that activates sporophytic fruitand seed coat development. Generally, the mutant fie alleles are nottransmitted by the female gametophyte. Inheritance of a mutant fieallele (e.g., fie3) by the female gametophyte usually results in embryoabortion, even when the pollen bears the wild-type FIE allele. In thecase of fie1 and fie2, however, transmission of the trait occurs inabout 1% of the progeny from the female gametophyte. In contrast, thefie1, fie2, and fie3 mutant alleles are passed through the malegametophyte (i.e., pollen) in normal fashion.

The isolated sequences prepared as described herein, can be used in anumber of techniques, for example, to suppress or enhance endogenous FIEgene expression. Modulation of FIE gene expression or FIE activity inplants is particularly useful, for example, in producing embryo-lessseed, parthenocarpic fruit, or as part of a system to generate apomicticseed.

Isolation of FIE Nucleic Acids

Generally, the nomenclature and the laboratory procedures in recombinantDNA technology described below are those well known and commonlyemployed in the art. Standard techniques are used for cloning, DNA andRNA isolation, amplification and purification. Generally enzymaticreactions involving DNA ligase, DNA polymerase, restrictionendonucleases and the like are performed according to the manufacturer'sspecifications. These techniques and various other techniques aregenerally performed according to Sambrook et al., Molecular Cloning—ALaboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor,N.Y., (1989).

The isolation of FIE nucleic acids may be accomplished by a number oftechniques. For instance, oligonucleotide probes based on the sequencesdisclosed here can be used to identify the desired gene in a cDNA orgenomic DNA library. To construct genomic libraries, large segments ofgenomic DNA are generated by random fragmentation, e.g. usingrestriction endonucleases, and are ligated with vector DNA to formconcatemers that can be packaged into the appropriate vector. To preparea cDNA library, mRNA is isolated from the desired organ, such as ovules,and a cDNA library which contains the FIE gene transcript is preparedfrom the mRNA. Alternatively, cDNA may be prepared from mRNA extractedfrom other tissues in which FIE genes or homologs are expressed.

The cDNA or genomic library can then be screened using a probe basedupon the sequence of a cloned FIE gene disclosed here. Probes may beused to hybridize with genomic DNA or cDNA sequences to isolatehomologous genes in the same or different plant species. Alternatively,antibodies raised against an FIE polypeptide can be used to screen anmRNA expression library.

Alternatively, the nucleic acids of interest can be amplified fromnucleic acid samples using amplification techniques. For instance,polymerase chain reaction (PCR) technology can be used to amplify thesequences of the FIE genes directly from genomic DNA, from cDNA, fromgenomic libraries or cDNA libraries. PCR and other in vitroamplification methods may also be useful, for example, to clone nucleicacid sequences that code for proteins to be expressed, to make nucleicacids to use as probes for detecting the presence of the desired mRNA insamples, for nucleic acid sequencing, or for other purposes. For ageneral overview of PCR see PCR Protocols: A Guide to Methods andApplications. (Innis, M, Gelfand, D., Sninsky, J. and White, T., eds.),Academic Press, San Diego (1990).

Appropriate primers and probes for identifying FIE sequences from planttissues are generated from comparisons of the sequences provided herewith other polycomb group genes. For instance, FIE1 can be compared tothe other polycomb genes containing the SET domain, such as theArabidopsis curly leaf gene (Goodrich et al. Nature 386:44-51 (1997)) orthe Drosophila enhancer of zeste (E(z)) gene. FIE3 can be compared togenes containing WD40 repeats, such as the extra sex combs (esc) genefrom Drosophila. Using these techniques, one of skill can identifyconserved regions in the nucleic acids disclosed here to prepare theappropriate primer and probe sequences. Primers that specificallyhybridize to conserved regions in FIE1 or FIE3 genes can be used toamplify sequences from widely divergent plant species.

Standard nucleic acid hybridization techniques using the conditionsdisclosed above can then be used to identify full length cDNA or genomicclones.

Control of FIE Activity or Gene Expression

Since FIE genes are involved in controlling seed, in particularendosperm, development, inhibition of endogenous Fie activity or geneexpression is useful in a number of contexts. For instance, inhibitionof expression is useful in the development of parthenocarpic fruit(i.e., fruit formed in the absence of fertilization).

In addition, inhibition of FIE activity can be used for production offruit with small and/or degraded seed (referred to here as “seedlessfruit”) after fertilization. In many plants, particularly dicots, theendosperm is not persistent and eventually is degraded. Thus, in plantsof the invention in which Fie activity is inhibited, embryo-less seed donot persist and seedless fruit are produced.

Alternatively, plants of the invention can be used to preventpre-harvest sprouting in seeds, especially those derived from cereals.In these plants, the endosperm persists and is the major component ofthe mature seed. Premature growth of embryos in stored grain causesrelease of degradative enzymes which digest starch and other componentsof the endosperm. Plants of the present invention are useful inaddressing this problem because the seeds lack an embryo and thus willnot germinate.

In yet another use, nucleic acids of the invention can be used in thedevelopment of apomictic plant lines (i.e., plants in which asexualreproductive processes occur in the ovule, see, Koltunow, A. Plant Cell5: 1425-1437 (1993) for a discussion of apomixis). Apomixis provides anovel means to select and fix complex heterozygous genotypes that cannotbe easily maintained by traditional breeding. Thus, for instance, newhybrid lines with desired traits (e.g., hybrid vigor) can be obtainedand readily maintained.

In still another use, nucleic acids of the invention can be used tocontrol endosperm production in transgenic plants. In particular,inhibition of FIE activity can be used to produce larger seeds withincreased endosperm. This trait is particularly useful in species inwhich the endosperm persists in the seed (e.g., monocots, particularlygrains).

One of skill will recognize that a number of methods can be used tomodulate FIE activity or gene expression. FIE activity can be modulatedin the plant cell at the gene, transcriptional, posttranscriptional,translational, or posttranslational, levels. Techniques for modulatingFIE activity at each of these levels are generally well known to one ofskill and are discussed briefly below.

Methods for introducing genetic mutations into plant genes are wellknown. For instance, seeds or other plant material can be treated with amutagenic chemical substance, according to standard techniques. Suchchemical substances include, but are not limited to, the following:diethyl sulfate, ethylene imine, ethyl methanesulfonate andN-nitroso-N-ethylurea. Alternatively, ionizing radiation from sourcessuch as, for example, X-rays or gamma rays can be used.

Alternatively, homologous recombination can be used to induce targetedgene disruptions by specifically deleting or altering the FIE gene invivo (see, generally, Grewal and Klar, Genetics 146: 1221-1238 (1997)and Xu et al., Genes Dev. 10: 2411-2422 (1996)). Homologousrecombination has been demonstrated in plants (Puchta et al.,Experientia 50: 277-284 (1994), Swoboda et al., EMBO J. 13: 484-489(1994); Offringa et al., Proc. Natl. Acad. Sci. USA 90: 7346-7350(1993); and Kempin et al. Nature 389:802-803 (1997)).

In applying homologous recombination technology to the genes of theinvention, mutations in selected portions of an FIE gene sequences(including 5′ upstream, 3′ downstream, and intragenic regions) such asthose disclosed here are made in vitro and then introduced into thedesired plant using standard techniques. Since the efficiency ofhomologous recombination is known to be dependent on the vectors used,use of dicistronic gene targeting vectors as described by Mountford etal. Proc. Natl. Acad. Sci. USA 91: 4303-4307 (1994); and Vaulont et al.Transgenic Res. 4: 247-255 (1995) are conveniently used to increase theefficiency of selecting for altered FIE gene expression in transgenicplants. The mutated gene will interact with the target wild-type gene insuch a way that homologous recombination and targeted replacement of thewild-type gene will occur in transgenic plant cells, resulting insuppression of FIE activity.

Alternatively, oligonucleotides composed of a contiguous stretch of RNAand DNA residues in a duplex conformation with double hairpin caps onthe ends can be used. The RNA/DNA sequence is designed to align with thesequence of the target FIE gene and to contain the desired nucleotidechange. Introduction of the chimeric oligonucleotide on anextrachromosomal T-DNA plasmid results in efficient and specific FIEgene conversion directed by chimeric molecules in a small number oftransformed plant cells. This method is described in Cole-Strauss et al.Science 273:1386-1389 (1996) and Yoon et al. Proc. Natl. Acad. Sci. USA93: 2071-2076 (1996).

Gene expression can be inactivated using recombinant DNA techniques bytransforming plant cells with constructs comprising transposons or T-DNAsequences. FIE mutants prepared by these methods are identifiedaccording to standard techniques. For instance, mutants can be detectedby PCR or by detecting the presence or absence of FIE mRNA, e.g., byNorthern blots. Mutants can also be selected by assaying for developmentof endosperm in the absence of fertilization.

The isolated nucleic acid sequences prepared as described herein, canalso be used in a number of techniques to control endogenous FIE geneexpression at various levels. Subsequences from the sequences disclosedhere can be used to control, transcription, RNA accumulation,translation, and the like.

A number of methods can be used to inhibit gene expression in plants.For instance, antisense technology can be conveniently used. Toaccomplish this, a nucleic acid segment from the desired gene is clonedand operably linked to a promoter such that the antisense strand of RNAwill be transcribed. The construct is then transformed into plants andthe antisense strand of RNA is produced. In plant cells, it has beensuggested that antisense suppression can act at all levels of generegulation including suppression of RNA, translation (see, Bourque PlantSci. (Limerick) 105: 125-149 (1995); Pantopoulos In Progress in NucleicAcid Research and Molecular Biology, Vol. 48. Cohn, W. E. and K. Moldave(Ed.). Academic Press, Inc.: San Diego, Calif., USA; London, England,UK. p. 181-238; Heiser et al. Plant Sci. (Shannon) 127: 61-69 (1997))and by preventing the accumulation of mRNA which encodes the protein ofinterest, (see, Baulcombe Plant Mol. Bio. 32:79-88 (1996); Prins andGoldbach Arch. Virol. 141: 2259-2276 (1996); Metzlaff et al. Cell 88:845-854 (1997), Sheehy et al., Proc. Nat. Acad. Sci. USA, 85:8805-8809(1988), and Hiatt et al., U.S. Pat. No. 4,801,340).

The nucleic acid segment to be introduced generally will besubstantially identical to at least a portion of the endogenous FIE geneor genes to be repressed. The sequence, however, need not be perfectlyidentical to inhibit expression. The vectors of the present inventioncan be designed such that the inhibitory effect applies to other geneswithin a family of genes exhibiting homology or substantial homology tothe target gene.

For antisense suppression, the introduced sequence also need not be fulllength relative to either the primary transcription product or fullyprocessed mRNA. Generally, higher homology can be used to compensate forthe use of a shorter sequence. Furthermore, the introduced sequence neednot have the same intron or exon pattern, and homology of non-codingsegments may be equally effective. Normally, a sequence of between about30 or 40 nucleotides and about full length nucleotides should be used,though a sequence of at least about 100 nucleotides is preferred, asequence of at least about 200 nucleotides is more preferred, and asequence of about 500 to about 1700 nucleotides is especially preferred.

A number of gene regions can be targeted to suppress FIE geneexpression. The targets can include, for instance, the coding regions,introns, sequences from exon/intron junctions, 5′ or 3′ untranslatedregions, and the like. In some embodiments, the constructs can bedesigned to eliminate the ability of regulatory proteins to bind to FIEgene sequences that are required for its cell- and/or tissue-specificexpression. Such transcriptional regulatory sequences can be locatedeither 5′-, 3′-, or within the coding region of the gene and can beeither promote (positive regulatory element) or repress (negativeregulatory element) gene transcription. These sequences can beidentified using standard deletion analysis, well known to those ofskill in the art. Once the sequences are identified, an antisenseconstruct targeting these sequences is introduced into plants to controlgene transcription in particular tissue, for instance, in developingovules and/or seed.

Oligonucleotide-based triple-helix formation can be used to disrupt FIEgene expression. Triplex DNA can inhibit DNA transcription andreplication, generate site-specific mutations, cleave DNA, and inducehomologous recombination (see, e.g., Havre and Glazer J. Virology67:7324-7331 (1993); Scanlon et al. FASEB J. 9:1288-1296 (1995);Giovannangeli et al. Biochemistry 35:10539-10548 (1996); Chan and GlazerJ. Mol. Medicine (Berlin) 75: 267-282 (1997)). Triple helix DNAs can beused to target the same sequences identified for antisense regulation.

Catalytic RNA molecules or ribozymes can also be used to inhibitexpression of FIE genes. It is possible to design ribozymes thatspecifically pair with virtually any target RNA and cleave thephosphodiester backbone at a specific location, thereby functionallyinactivating the target RNA. In carrying out this cleavage, the ribozymeis not itself altered, and is thus capable of recycling and cleavingother molecules, making it a true enzyme. The inclusion of ribozymesequences within antisense RNAs confers RNA-cleaving activity upon them,thereby increasing the activity of the constructs. Thus, ribozymes canbe used to target the same sequences identified for antisenseregulation.

A number of classes of ribozymes have been identified. One class ofribozymes is derived from a number of small circular RNAs which arecapable of self-cleavage and replication in plants. The RNAs replicateeither alone (viroid RNAs) or with a helper virus (satellite RNAs).Examples include RNAs from avocado sunblotch viroid and the satelliteRNAs from tobacco ringspot virus, lucerne transient streak virus, velvettobacco mottle virus, solanum nodiflorum mottle virus and subterraneanclover mottle virus. The design and use of target RNA-specific ribozymesis described in Zhao and Pick Nature 365:448-451 (1993); Eastham andAhlering J. Urology 156:1186-1188 (1996); Sokol and Murray TransgenicRes. 5:363-371 (1996); Sun et al. Mol. Biotechnology 7:241-251 (1997);and Haseloff et al. Nature, 334:585-591 (1988).

Another method of suppression is sense cosuppression. Introduction ofnucleic acid configured in the sense orientation has been recently shownto be an effective means by which to block the transcription of targetgenes. For an example of the use of this method to modulate expressionof endogenous genes (see, Assaad et al. Plant Mol. Bio. 22: 1067-1085(1993); Flavell Proc. Natl. Acad. Sci. USA 91: 3490-3496 (1994); Stam etal. Annals Bot. 79: 3-12 (1997); Napoli et al., The Plant Cell 2:279-289(1990); and U.S. Pat. Nos. 5,034,323, 5,231,020, and 5,283,184).

The suppressive effect may occur where the introduced sequence containsno coding sequence per se, but only intron or untranslated sequenceshomologous to sequences present in the primary transcript of theendogenous sequence. The introduced sequence generally will besubstantially identical to the endogenous sequence intended to berepressed. This minimal identity will typically be greater than about65%, but a higher identity might exert a more effective repression ofexpression of the endogenous sequences. Substantially greater identityof more than about 80% is preferred, though about 95% to absoluteidentity would be most preferred. As with antisense regulation, theeffect should apply to any other proteins within a similar family ofgenes exhibiting homology or substantial homology.

For sense suppression, the introduced sequence, needing less thanabsolute identity, also need not be full length, relative to either theprimary transcription product or fully processed mRNA. This may bepreferred to avoid concurrent production of some plants which areoverexpressers. A higher identity in a shorter than full length sequencecompensates for a longer, less identical sequence. Furthermore, theintroduced sequence need not have the same intron or exon pattern, andidentity of non-coding segments will be equally effective. Normally, asequence of the size ranges noted above for antisense regulation isused. In addition, the same gene regions noted for antisense regulationcan be targetted using cosuppression technologies.

Alternatively, FIE activity may be modulated by eliminating the proteinsthat are required for FIE cell-specific gene expression. Thus,expression of regulatory proteins and/or the sequences that control FIEgene expression can be modulated using the methods described here.

Another method is use of engineered tRNA suppression of FIE mRNAtranslation. This method involves the use of suppressor tRNAs totransactivate target genes containing premature stop codons (see,Betzner et al. Plant J.11:587-595 (1997); and Choisne et al. Plant J.11:597-604 (1997). A plant line containing a constitutively expressed FIEgene that contains an amber stop codon is first created. Multiple linesof plants, each containing tRNA suppressor gene constructs under thedirection of cell-type specific promoters are also generated. The tRNAgene construct is then crossed into the FIE line to activate FIEactivity in a targeted manner. These tRNA suppressor lines could also beused to target the expression of any type of gene to the same cell ortissue types.

As noted above, FIE proteins as products of polycomb group genes arebelieved to form large complexes in vivo. Thus, production ofdominant-negative forms of FIE polypeptides that are defective in theirabilities to bind to other polycomb group proteins is a convenient meansto inhibit endogenous FIE activity. This approach involvestransformation of plants with constructs encoding mutant FIEpolypeptides that form defective complexes with endogenous polycombgroup proteins and thereby prevent the complex from forming properly.The mutant polypeptide may vary from the naturally occurring sequence atthe primary structure level by amino acid substitutions, additions,deletions, and the like. These modifications can be used in a number ofcombinations to produce the final modified protein chain. Use ofdominant negative mutants to inactivate target genes is described inMizukami et al. Plant Cell 8:831-845 (1996).

Another strategy to affect the ability of an FIE protein to interactwith itself or with other proteins involves the use of antibodiesspecific to FIE. In this method cell-specific expression of FIE-specificAbs is used inactivate functional domains through antibody:antigenrecognition (see, Hupp et al. Cell 83:237-245 (1995)).

Use of Nucleic Acids of the Invention to Enhance FIE Gene Expression

Isolated sequences prepared as described herein can also be used tointroduce expression of a particular FIE nucleic acid to enhance orincrease endogenous gene expression. For instance, polycomb genes areknown to control cell cycling. Enhanced expression can therefore be usedto control plant morphology by controlling whether or not cell divisiontakes place in desired tissues or cells. Enhanced expression can also beused, for instance, to increase vegetative growth by preventing theplant from setting seed. Where overexpression of a gene is desired, thedesired gene from a different species may be used to decrease potentialsense suppression effects.

One of skill will recognize that the polypeptides encoded by the genesof the invention, like other proteins, have different domains whichperform different functions. Thus, the gene sequences need not be fulllength, so long as the desired functional domain of the protein isexpressed.

Modified protein chains can also be readily designed utilizing variousrecombinant DNA techniques well known to those skilled in the art anddescribed in detail, below. For example, the chains can vary from thenaturally occurring sequence at the primary structure level by aminoacid substitutions, additions, deletions, and the like. Thesemodifications can be used in a number of combinations to produce thefinal modified protein chain.

Preparation of Recombinant Vectors

To use isolated sequences in the above techniques, recombinant DNAvectors suitable for transformation of plant cells are prepared.Techniques for transforming a wide variety of higher plant species arewell known and described in the technical and scientific literature.See, for example, Weising et al. Ann. Rev. Genet. 22:421-477 (1988). ADNA sequence coding for the desired polypeptide, for example a cDNAsequence encoding a full length protein, will preferably be combinedwith transcriptional and translational initiation regulatory sequenceswhich will direct the transcription of the sequence from the gene in theintended tissues of the transformed plant.

For example, for overexpression, a plant promoter fragment may beemployed which will direct expression of the gene in all tissues of aregenerated plant. Such promoters are referred to herein as“constitutive” promoters and are active under most environmentalconditions and states of development or cell differentiation. Examplesof constitutive promoters include the cauliflower mosaic virus (CaMV)35S transcription initiation region. the 1′- or 2′- promoter derivedfrom T-DNA of Agrobacterium tumafaciens, and other transcriptioninitiation regions from various plant genes known to those of skill.Such genes include for example, ACT11 from Arabidopsis (Huang et al.Plant Mol. Biol. 33:125-139 (1996)), Cat3 from Arabidopsis (GenBank No.U43147, Zhong et al., Mol. Gen. Genet. 251:196-203 (1996)), the geneencoding stearoyl-acyl carrier protein desaturase from Brassica napus(Genbank No. X74782, Solocombe et al. Plant Physiol. 104:1167-1176(1994)), GPc1 from maize (GenBank No. X15596, Martinez et al. J. Mol.Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. U45855,Manjunath et al., Plant Mol. Biol. 33:97-112 (1997)).

Alternatively, the plant promoter may direct expression of the FIEnucleic acid in a specific tissue or may be otherwise under more preciseenvironmental or developmental control. Examples of environmentalconditions that may effect transcription by inducible promoters includeanaerobic conditions, elevated temperature, or the presence of light.Such promoters are referred to here as “inducible” or “tissue-specific”promoters. One of skill will recognize that a tissue-specific promotermay drive expression of operably linked sequences in tissues other thanthe target tissue. Thus, as used herein a tissue-specific promoter isone that drives expression preferentially in the target tissue, but mayalso lead to some expression in other tissues as well.

Examples of promoters under developmental control include promoters thatinitiate transcription only (or primarily only) in certain tissues, suchas fruit, seeds, or flowers. Promoters that direct expression of nucleicacids in ovules, flowers or seeds are particularly useful in the presentinvention. As used herein a seed-specific promoter is one which directsexpression in seed tissues, such promoters may be, for example,ovule-specific (which includes promoters which direct expression inmaternal tissues or the female gametophyte, such as egg cells or thecentral cell), embryo-specific, endosperm-specific, integument-specific,seed coat-specific, or some combination thereof. Examples include apromoter from the ovule-specific BEL1 gene described in Reiser et al.Cell 83:735-742 (1995) (GenBank No. U39944). Other suitable seedspecific promoters are derived from the following genes: MAC1 from maize(Sheridan et al. Genetics 142:1009-1020 (1996), Cat3 from maize (GenBankNo. L05934, Abler et al. Plant Mol. Biol. 22:10131-1038 (1993), the geneencoding oleosin 18kD from maize (GenBank No. J05212, Lee et al. PlantMol. Biol. 26:1981-1987 (1994)), vivparous-1 from Arabidopsis (GenbankNo. U93215), the gene encoding oleosin from Arabidopsis (Genbank No.Z17657), Atmycl from Arabidopsis (Urao et al. Plant Mol. Biol.32:571-576 (1996), the 2s seed storage protein gene family fromArabidopsis (Conceicao et al. Plant 5:493-505 (1994)) the gene encodingoleosin 20kD from Brassica napus (GenBank No. M63985), napA fromBrassica napus (GenBank No. J02798, Josefsson et al. JBL 26:12196-1301(1987), the napin gene family from Brassica napus (Sjodahl et al. Planta197:264-271 (1995), the gene encoding the 2S storage protein fromBrassica napus (Dasgupta et al. Gene 133:301-302 (1993)), the genesencoding oleosin A (Genbank No. U09118) and oleosin B (Genbank No.U09119) from soybean and the gene encoding low molecular weight sulphurrich protein from soybean (Choi et al. Mol Gen, Genet. 246:266-268(1995)).

In addition, the promoter sequences from the FIE genes disclosed herecan be used to drive expression of the FIE polynucleotides of theinvention or heterologous sequences. The sequences of the promoters areidentified below.

If proper polypeptide expression is desired, a polyadenylation region atthe 3′-end of the coding region should be included. The polyadenylationregion can be derived from the natural gene, from a variety of otherplant genes, or from T-DNA.

The vector comprising the sequences (e.g., promoters or coding regions)from genes of the invention will typically comprise a marker gene whichconfers a selectable phenotype on plant cells. For example, the markermay encode biocide resistance, particularly antibiotic resistance, suchas resistance to kanamycin, G418, bleomycin, hygromycin, or herbicideresistance, such as resistance to chlorosulfuron or Basta.

Production of Transgenic Plants

DNA constructs of the invention may be introduced into the genome of thedesired plant host by a variety of conventional techniques. For example,the DNA construct may be introduced directly into the genomic DNA of theplant cell using techniques such as electroporation and microinjectionof plant cell protoplasts, or the DNA constructs can be introduceddirectly to plant tissue using ballistic methods, such as DNA particlebombardment.

Microinjection techniques are known in the art and well described in thescientific and patent literature. The introduction of DNA constructsusing polyethylene glycol precipitation is described in Paszkowski etal. Embo J. 3:2717-2722 (1984). Electroporation techniques are describedin Fromm et al. Proc. Natl. Acad. Sci. USA 82:5824 (1985). Ballistictransformation techniques are described in Klein et al. Nature 327:70-73(1987).

Alternatively, the DNA constructs may be combined with suitable T-DNAflanking regions and introduced into a conventional Agrobacteriumtumefaciens host vector. The virulence functions of the Agrobacteriumtumefaciens host will direct the insertion of the construct and adjacentmarker into the plant cell DNA when the cell is infected by thebacteria. Agrobacterium tumefaciens-mediated transformation techniques,including disarming and use of binary vectors, are well described in thescientific literature. See, for example Horsch et al. Science233:496-498 (1984), and Fraley et al. Proc. Natl. Acad. Sci. USA 80:4803(1983).

Transformed plant cells which are derived by any of the abovetransformation techniques can be cultured to regenerate a whole plantwhich possesses the transformed genotype and thus the desired phenotypesuch as increased seed mass. Such regeneration techniques rely onmanipulation of certain phytohormones in a tissue culture growth medium,typically relying on a biocide and/or herbicide marker which has beenintroduced together with the desired nucleotide sequences. Plantregeneration from cultured protoplasts is described in Evans et al.,Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp.124-176, MacMillilan Publishing Company, New York, 1983; and Binding,Regeneration of Plants, Plant Protoplasts, pp. 21-73, CRC Press, BocaRaton, 1985. Regeneration can also be obtained from plant callus,explants, organs, or parts thereof. Such regeneration techniques aredescribed generally in Klee et al. Ann. Rev. of Plant Phys. 38:467-486(1987).

The nucleic acids of the invention can be used to confer desired traitson essentially any plant. Thus, the invention has use over a broad rangeof plants, including species from the genera Anacardium, Arachis,Asparagus, Atropa, Avena, Brassica, Citrus. Citrullus, Capsicum,Carthamus, Cocos, Coffea, Cucumis, Cucurbita, Daucus, Elaeis, Fragaria,Glycine, Gossypium, Helianthus, Heterocallis, Hordeum, Hyoscyamus,Lactuca, Linum, Lolium, Lupinus, Lycopersicon, Malus, Manihot, Majorana,Medicago, Nicotiana, Olea, Oryza, Panieum, Pannesetum, Persea,Phaseolus, Pistachia, Pisum, Pyrus, Prunus, Raphanus, Ricinus, Secale,Senecio, Sinapis, Solanum, Sorghum, Theobromus, Trigonella, Triticum,Vicia, Vitis, Vigna, and Zea.

One of skill will recognize that after the expression cassette is stablyincorporated in transgenic plants and confirmed to be operable, it canbe introduced into other plants by sexual crossing. Any of a number ofstandard breeding techniques can be used, depending upon the species tobe crossed.

Seed obtained from plants of the present invention can be analyzedaccording to well known procedures to identify plants with the desiredtrait. If antisense or other techniques are used to control Fie geneexpression, Northern blot analysis can be used to screen for desiredplants. In addition, the presence of fertilization independentreproductive development can be detected. Plants can be screened, forinstance, for the ability to form embryo-less seed, form seed that abortafter fertilization, or set fruit in the absence of fertilization. Theseprocedures will depend, part on the particular plant species being used,but will be carried out according to methods well known to those ofskill.

The following Examples are offered by way of illustration, notlimitation.

EXAMPLE 1

The following example describes methods used to identify the fiemutants. The methods described here are generally as described in Ohadet al., Proc. Natl. Acad. Sci. USA 93:5319-5324 (1996).

Materials and Methods

Growth and Phenotype of Plants

Plants were grown under low humidity conditions (less than 50%) in glasshouses under 16 hr light/8 hr dark photoperiods generated bysupplemental lighting. Plants were grown at high humidity (greater than80%) in a lighted incubator (Percival, Boone, Iowa).

To test for fertilization-independent development, flower buds fromplants that had not yet begun to shed pollen (stage 12; (Smyth, D. R.,et al., Plant Cell 2: 755-761 (1990))) were opened, immature antherswere removed, and the flower bud was covered with a plastic bag. Sevendays later, the silique was measured, dissected, and the number ofseed-like structures and degenerating ovules were counted. To determinethe frequency of seed abortion following fertilization, siliques wereharvested 10 days after self-pollination, dissected, and wild-type andaborted seeds were counted.

Genetic Mapping

Heterozygous FIE/fie (Landsberg erecta ecotype) plants were crossed asmales with female plants (Columbia ecotype). Because the mutant fieallele is only transmitted through the male gametophyte, FIE/fie progenywere crossed as males a second time to female g11/g11 (Columbia ecotype)plants. Approximately fifty-five progeny were scored for the segregationof the wild-type FIE and mutant fie alleles and for alleles of molecularmarkers as described previously (Bell, C., et al., Genomics 19: 137-144(1994)). This analysis indicated that fie3 is located at approximatelyposition 30 on chromosome three, fie2 is located at approximatelyposition 65 on chromosome two, and fie1 is located at approximatelyposition 2 on chromosome one. Genetic recombination frequencies and mapdistances were calculated according to Koornneef and Stam (Koornneef,M., et al., Methods in Arabidopsis Research, pp. 83-99 (1992)) andKosambi (Kosambi, Ann. Eugen., 12: 172-175 (1944)).

Light Microscopy

Nomarski photographs of whole-mount embryos and endosperm were obtainedby fixing longitudinally slit siliques in an ethanol:acetic acid (9:1)solution overnight, followed by two washes in 90% and 70% ethanol,respectively. Siliques were cleared with a chloralhydrate:glycerol:water solution (8:1:2, w:v:v) (Berleth, T., et al.,Devel 118: 575-587 (1993)). Whole mount preparations were fixed andstained with hematoxylin (Beeckman, T., et al., Plant Mol Biol Rep 12:37-42 (1994)). Embryo and endosperm were photographed with a ZeissAxioskop microscope (Carl Zeiss, Inc., Oberkochen, Germany) usingNomarski optics that permits visualization of optical sections withinthe seed.

GUS Histochemical Assays

GUS activity was detected histochemically as described previously by(Beeckman, T., et al., Plant Mol Biol Rep 12: 37-42 (1994)).

Image Processing

Photographs were scanned using a Microtek scanner. Pictures wereprocessed for publication using Adobe Photoshop 3.0 and printed on aTektronix Phaser 400 color printer.

Results

Isolation of Mutant Lines

To begin to understand mechanisms that initiate reproductivedevelopment, we generated mutant Arabidopsis plants that undergo severalreproductive processes in the absence of fertilization. Arabidopsisplants homozygous for the conditional male sterile popl mutation(Preuss, D., et al., Genes and Devel 7: 974-985 (1993)) were used as theparental strain (Landsberg erecta ecotype). Fertility in pop1 plants issensitive to humidity because pop1 pollen do not hydrate properly due toa defect in wax biosynthesis. When grown at permissive condition, highrelative humidity (>80%), popI plants were male fertile and producedlong siliques with many viable seeds. By contrast, when grown atnon-permissive condition, low relative humidity (<50%), pop1 plants weremale sterile and produced short siliques with no seeds. Thus, siliqueelongation is a marker for reproductive events. To isolate mutations,homozygous popl seeds were mutagenized with ethylmethansulfonate (EMS)and approximately 50,000 M1 plants were screened for silique elongationat non-permissive conditions. Rare M1 plants were identified thatdisplayed heterozygous sectors with elongated siliques. These plantswere transferred to permissive conditions to insure the production ofviable M2 seed. Plants from M2 and M3 families grown at non-permissiveconditions were rechecked for non-sectored silique elongation. Toeliminate any effects of the pop1 mutation, or other EMS-induced lesionson the mutant phenotype, mutant plants were backrossed twice, as males,to wild-type plants. After removing the pop1 mutation,fertilization-independent phenotypes were confirmed after manual removalof anthers from immature flowers before pollen was shed. A total oftwelve lines were identified that displayed elongated siliques in theabsence of fertilization.

Fertilization-Independent Endosperm, Seed Coat and Silique Development

In a representative line chosen for further study, heterozygous plantsproduced by back crosses to wild-type plants generated elongatedsiliques after anther removal with numerous seed-like, structures. Theseresults indicated that heterozygous mutant plants were capable ofsilique, elongation and seed-like structure development in the absenceof fertilization. We compared the development of the mutant seed-likestructures to that of wild-type seeds. After fertilization, theendosperm nucleus replicated and daughter nuclei migrated into theexpanding central cell. Ultimately, a syncytium of endosperm nuclei wasproduced. Nuclear divisions of the endosperm preceded the zygoticdivisions that formed the globular stage embryo. Embryo, endosperm orseed coat development did not occur in wild-type plants in the absenceof fertilization. Development of the ovule and female gametophyte inheterozygous mutant plants was normal. Just prior to flower opening,female gametophytes in these plants contained a single, prominentcentral cell nucleus. Subsequently, in the absence of fertilization,central cells with two large nuclei were detected. Further divisionsresulted in the production of additional nuclei that migrated into theexpanded central cell. Later in development, a nuclear syncytium wasformed with abundant endosperm nuclei. These results indicated that thecentral cell in mutant female gametophytes initiated endospermdevelopment in the absence of fertilization. We have named this mutationfie for fertilization-independent endosperm. By contrast, replication ofother nuclei in fie female gametophytes (egg, synergid, or antipodal)was not detected. Thus, the fie mutation specifically affectsreplication of the central cell nucleus.

We analyzed the frequency of multinucleate central cell formation in fiefemale gametophytes by comparing the percentage of multinucleate centralcells at three, five, and six days after emasculation of heterozygousFIE/fie and control wild-type flowers. At each time point, only 3% to 5%of wild-type central cells had more than one nucleus. Because none hadmore than two nuclei, most likely, these represented central cells withhaploid nuclei that had not fused during female gametophyte development.By contrast, the percentage of central cells in female gametophytes fromFIE/fie siliques with two or more nuclei increased from 21% to 47% overthe same time period. These results indicated that the fie mutationcaused a significant increase in formation of multinucleate centralcells in the absence of fertilization. The fact that close to 50% of thefemale gametophytes in heterozygous plants had multinucleate centralcells suggested that fie is a gametophytic mutation because a 1:1segregation of wild-type and mutant fie alleles occurs during meiosis.

We compared the fertilization-independent development of the maternalseed coat in FIE/fie seed-like structures to that of fertilizedwild-type seeds. The seed coat in wild-type Arabidopsis is generated bythe integuments of the ovule and surrounds the developing embryo andendosperm. Similarly, FIE/fie ovule integuments formed a seed coat thatsurrounded the developing mutant endosperm. These results indicated thatthe fie mutation activated both endosperm development and maternalsporophytic seed coat and silique differentiation that supportreproduction. No other effects on sporophytic growth and developmentwere detected in FIE/fie plants.

The fie3 Mutant Allele is not Transmitted by the Female Gametophyte tothe Next Generation

To understand the mode of inheritance of the fie mutation, we analyzedthe progeny of reciprocal crosses. FIE3/fie3 females, crossed towild-type males, produced siliques with approximately equal numbers ofviable seeds with normal green embryos and nonviable white seeds withembryos aborted at the heart stage (344:375, 1:1, c2=1.3, P>0.2). Viableseeds from this cross were germinated and all 120 F1 progeny generatedwere wild-type. That is, none of the F1 progeny had significant levelsof F2 aborted seeds in their siliques after self-pollination. Nor didthe F1 progeny demonstrate fertilization-independent development. Thisindicated that presence of the fie mutant allele in the femalegametophyte, even when the male provided a wild-type allele, resulted inembryo abortion Thus, the fie mutation is not transmitted by the femalegametophyte to the next generation. To study transmission of fie throughthe male gametophyte, we pollinated female wild-type plants with pollenfrom male FIE3/fie3 plants. Siliques from these crosses contained noaborted F1 seed. F1 plants were examined and a 1:1 segregation ofwild-type and FIE3/fie3 genotype was observed (62:58, c2=0.13, P>0.5).This indicated that wild-type and mutant fie3 alleles were transmittedby the male gametophyte with equal efficiency. That is, fie does notaffect male gametophyte, or pollen grain, function. Results fromreciprocal crosses were verified by analyzing the progeny fromself-pollinated FIE3/fie3 plants. Self-pollinated siliques displayed 1:1segregation of normal and aborted seeds (282:286, c2=0.03, P>0.8).Viable seed from self-pollinated siliques were germinated and a 1:1(71:64, c2=0.36, P>0.5) segregation of wild-type and FIE3/fie3 progenywas observed. These results confirmed that inheritance of a fie mutantallele by the female gametophyte resulted in embryo abortion, and thatinheritance of a fie mutant allele by the male gametophyte did notaffect pollen function. Thus, the wild-type FIE3 allele probably carriesout a function unique to the female gametophyte and does not appear tobe needed for male fertility.

In contrast, fie1 and fie2 mutant alleles were transmitted at lowfrequencies (about 1% of normal) through the female gametophyte. In thisway, fie1 homozygous mutants and fie2 homozygous mutants were obtainedthat appeared to display normal vegetative growth and development.

Discussion

In wild-type plants, fertilization initiates embryogenesis and endospermformation, and activates maternal seed coat and silique development. Theresults presented here indicate that specific aspects of plantreproductive development can occur in FIE/fie plants in the absence offertilization. These include silique elongation, seed coat formation,and endosperm development. Morphological analysis shows that earlyaspects of fertilization-independent fie endosperm development closelyresemble fertilized wild-type endosperm development. First, the fiecentral cell nucleus is stimulated to undergo replication. Second,nuclei that are produced migrate from the micropylar end of the centralcell and take up new positions in the central cell. Third, thedeveloping fie central cell expands to form an endosperm cavity. Thus,the requirement for fertilization to initiate these early events inendosperm formation has been eliminated by the fie mutation. Thissuggests that FIE plays a role in a signal transduction pathway thatlinks fertilization with the onset of central cell nuclear replicationand early endosperm development.

Mechanisms for Regulation of Endosperm Development by FIE

One can envision two possible mechanisms for how FIE regulatesreplication of the central cell nucleus in response to fertilization.The protein encoded by the FIE gene may be involved in a positiveregulatory interaction. In this model, FIE is required for the centralcell to initiate endosperm development. Normally, fertilization isneeded for the presence of active FIE protein. The fie mutation resultsin the presence of active protein in the absence of fertilization.Alternatively, FIE may by involved in a negative regulatory interaction.In this model, the function of FIE protein is to prevent the centralcell from initiating endosperm development, and fertilization results inthe inactivation of FIE protein. The fie mutation results in theproduction of inactive protein, so that fertilization is no longerrequired to initiate endosperm development. However, complementationexperiments using transgenic plants indicate that FIE1 and FIE3 allelesare dominant over their respective mutant alleles. This indicates thatthe wild-type allele is involved in a negative regulatory interaction.Recently, it has been shown that cyclin-dependent kinase complexes,related to those that function in mammals, control the induction of DNAsynthesis and mitosis in maize endosperm (Grafi, G.et al., Science269:1262-1264 (1995)). Because fie stimulates replication of the centralcell, fie may, either directly or indirectly, impinge upon cell cyclecontrol of the central cell nucleus, allowing replication to take placein the absence of fertilization.

Communication between the fie Female Gametophyte and the SporophyticOvule and Carpels

The analysis of FIE/fie mutant plants has provided clues aboutinteractions between endosperm and maternal sporophytic tissues. FIE/fieovule integuments surrounding a mutant fie female gametophyte initiateseed coat development, whereas FIE/fie integuments in contact with aquiescent wild-type female gametophyte do not develop. This suggeststhat the FIE/fie ovule integuments initiate seed coat differentiation inresponse to a signal produced by the fie female gametophyte. We proposethat the source of the signal is the mutant fie central cell that hasinitiated endosperm development, although we cannot rule out theparticipation of other cells in the fie female gametophyte. In wild-typeplants, most likely, fertilization of the central cell produces anendosperm that activates seed coat development. This is consistent withexperiments showing that the maize endosperm interacts with nearbymaternal cells (Miller, M. E., et al., Plant Cell 4: 297-305 (1992)).FIE/fie plants also display fertilization-independent elongation of theovary to form the silique. We propose that a signal is produced by thedeveloping seed-like structures to initiate silique elongation. This isin agreement with experiments suggesting that seeds are the source ofhormones, auxins and gibberellins, that activate fruit development (Lee,T. D. Plant Reproductive Ecology, pp. 179-202 (1988)). Taken together,these results suggest that the fertilized female gametophyte activatesmaternal developmental programs.

Relationship between Fie and Apomixis

Certain plant species display aspects of fertilization-independentreproductive development, including apomictic generation of embryo andendosperm, and development of the maternal seed coat and fruit (reviewedin (Koltunow, a. Plant Cell 5: 1425-1437 (1993)). The fie mutationreveals that Arabidopsis, a sexually reproducing plant, has the geneticpotential for aspects of fertilization-independent reproductivedevelopment. It is not known whether the mechanism offertilization-independent endosperm development conferred by the fiemutation is the same as autonomous endosperm formation observed incertain apomictic plant species. However, the fact that the fiephenotype is caused by a single genetic locus substantiates the viewthat the number of genetic differences between sexually and asexuallyreproducing plants is small (Koltunow, a. M., et al., Plant Physiol108:1345-1352 (1995)).

EXAMPLE 2 This example describes cloning of two Fie genes, Fie1 and Fie3

Cloning the FIE3 Gene

a. Mapping the position of the fie3 gene genetically. The fie3 mutationwas initially mapped to position 30 on chromosome 3, between AXR2 (auxinresistant dwarf) and EMB29 (embryo lethal). Next, two sets of F2 plantswith recombination breakpoints in the fie3 gene region were obtained.One set was between emb29 and fie3 and the other set was between axr2and fie3. As shown in FIG. 1A, these recombinants were used to map thefie3 gene relative to molecular markers (NDR, CH18, CH18S, BO20, AG20,KN1 and E13F12) that were obtained from overlapping YAC (yUP13F12), BAC(T1B4 and T4N1) and cosmid clones (FIG. 1A). YAC and BAC clones wereobtained from the Arabidopsis Stock Center (Ohio State University, USA).Cosmid subclones were generated in my laboratory. As shown in FIGS. 1Aand 1B, this genetic analysis indicates that the fie3 gene resideswithin the 25 Kb region between the BO20 and AG20 markers.

b. Mapping the position of the fie3 gene by complementation experiments.To more precisely localize the fie3 gene, we analyzed a series ofoverlapping cosmid clones (BO20, GM15, AG20 and EI12) that span the fie3gene region. Each cosmid clone was tested for its ability to complementthe fie3 mutation in transgenic plants. Only cosmid GM15 complementedthe fie3 mutation (FIG. 1A). These results indicate that an essentialportion of the fie3 gene is in the 10 Kb region that is unique to cosmidGM15. As shown in FIG. 1B, we have cloned DNA that spans this essentialportion of the fie3 gene and have determined its DNA sequence. As shownin FIG. 2, analysis of the sequence using the GENSCANW program revealeda gene with an open reading frame. The predicted cDNA sequence andpredicted amino acid sequence are shown in SEQ ID NO:3 and SEQ ID NO:4,respectively. Comparing the predicted amino acid sequence to those inpublic data bases revealed significant homology to the WD40 family ofPolycomb Group genes, and in particular, the “extra sex combs” gene inDrosophila. FIG. 3 shows the position of primers used to PCR amplifythis region. SEQ ID NO:5 provides the genomic DNA sequence of theWD40/Polycomb gene, plus approximately 3.8 Kb of 5′-flanking sequencesand 0.3 Kb of 3′-flanking sequences, plus the sequence of primers (SEQID NOS:305-324) used to PCR amplify this region. The transcription startsite in SEQ ID NO:5 is at position 3,872. Thus, the promoter sequencefor FIE3 is located between position 1 and 3,872. The 5′-flanking and3′-flanking regions contain regulatory DNA sequences that control theexpression of this gene.

Cloning the FIE1 Gene

a. Mapping the position of the FIE1 gene genetically. The fie1 mutationwas initially mapped to position 3 on chromosome 1, between AXR3 (auxinresistant dwarf) and EMB60 (embryo lethal). Next, two sets of F2 plantswith recombination breakpoints in the FIE1 gene region were obtained.One set was between emb60 and fie3 and the other set was between axr3and fie3. These recombinants were used to map the fie3 gene relative tomolecular markers (FIG. 4) that were obtained from an overlapping seriesof YAC and BAC clones from the Arabidopsis Stock Center (Ohio StateUniversity, USA).

b. Mapping the position of the FIE1 gene by complementation experiments.To more precisely localize the FIE1 gene, a series of overlapping cosmidclones (2-9, 6-22, 2-8) that span the FIE1 gene region were analyzed(FIG. 4). Each cosmid clone was tested for its ability to complement thefie1 mutation in transgenic plants. Only cosmid 6-22 complemented thefie1 mutation. The cosmids were analyzed for genes with open readingframes. FIG. 5 shows that a single gene was present on the complementingcosmid (6-22) that was not fully encoded on either of thenon-complementing cosmids (2-9 and 2-8). By RTPCR and 5′-race, the cDNAsequence of this gene and predicted amino acid of its protein wereobtained (SEQ ID NO:1 and SEQ ID NO:2, respectively). Comparison of thepredicted amino acid sequence to those in public data bases revealedsignificant homology to the SET family of Polycomb Group Genes (e.g.,Enhancer of Zeste in Drosophila and Curly Leaf in Arabiopsis). Wecompared the wild-type and fiel mutant sequence in 6-22. The onlydifference is a single base pair change that creates a prematuretranslation stop codon in the 5′-end of the set/polycomb group gene. Thebase pair change is at position 823 (C-→T) on the cDNA sequence shown inSEQ ID NO:1.

SEQ ID NO:6 shows the genomic sequence of the FIE1 SET/polycomb gene,plus approximately 2 Kb of 5′-flanking sequences and approximately 0.7Kb of 3′-flanking sequences. The translation start site is located atposition 2036 of SEQ ID NO:6. Thus, the promoter sequence is locatedbetween position 1 and position 2036. Peptides encoded by reading frame1=SEQ ID NOS:7-111; reading frame 2=SEQ ID NOS:112-200; reading frame3=SEQ ID NOS:201-304.

The above examples are provided to illustrate the invention but not tolimit its scope. Other variants of the invention will be readilyapparent to one of ordinary skill in the art and are encompassed by theappended claims. All publications, patents, and patent applicationscited herein are hereby incorporated by reference.

SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 324 <210> SEQ ID NO: 1<211> LENGTH: 2136 <212> TYPE: DNA <213> ORGANISM: Arabidopsis sp. <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (43)..(2112) <223> OTHERINFORMATION: fertilization-independent endos perm 1 (FIE1) cDNA <400>SEQUENCE: 1 aacatcagag aagacgagaa aaaaagaaga ggcgagtggt ta atg gag aaggaa 54 Met Glu Lys Glu 1 aac cat gag gac gat ggt gag ggt ttg cca cc cgaa cta aat cag ata 102 Asn His Glu Asp Asp Gly Glu Gly Leu Pro Pr o GluLeu Asn Gln Ile 5 10 15 20 aaa gag caa atc gaa aag gag aga ttt ctg ca tatc aag aga aaa ttc 150 Lys Glu Gln Ile Glu Lys Glu Arg Phe Leu Hi s IleLys Arg Lys Phe 25 30 35 gag ctg aga tac att cca agt gtg gct act ca tgct tca cac cat caa 198 Glu Leu Arg Tyr Ile Pro Ser Val Ala Thr Hi s AlaSer His His Gln 40 45 50 tcg ttt gac tta aac cag ccc gct gca gag ga tgat aat gga gga gac 246 Ser Phe Asp Leu Asn Gln Pro Ala Ala Glu As p AspAsn Gly Gly Asp 55 60 65 aac aaa tca ctt ttg tcg aga atg caa aac cc actt cgt cat ttc agt 294 Asn Lys Ser Leu Leu Ser Arg Met Gln Asn Pr o LeuArg His Phe Ser 70 75 80 gcc tca tct gat tat aat tct tac gaa gat ca aggt tat gtt ctt gat 342 Ala Ser Ser Asp Tyr Asn Ser Tyr Glu Asp Gl n GlyTyr Val Leu Asp 85 90 95 100 gag gat caa gat tat gct ctt gaa gaa gat gta cca tta ttt ctt gat 390 Glu Asp Gln Asp Tyr Ala Leu Glu Glu Asp Va lPro Leu Phe Leu Asp 105 110 115 gaa gat gta cca tta tta cca agt gtc aagct t cca att gtt gag aag 438 Glu Asp Val Pro Leu Leu Pro Ser Val Lys Leu Pro Ile Val Glu Lys 120 125 130 cta cca cga tcc att aca tgg gtc ttcacc aa a agt agc cag ctg atg 486 Leu Pro Arg Ser Ile Thr Trp Val Phe ThrLy s Ser Ser Gln Leu Met 135 140 145 gct gaa agt gat tct gtg att ggt aagaga ca a atc tat tat ttg aat 534 Ala Glu Ser Asp Ser Val Ile Gly Lys ArgGl n Ile Tyr Tyr Leu Asn 150 155 160 ggt gag gca cta gaa ttg agc agt gaagaa ga t gag gaa gat gaa gaa 582 Gly Glu Ala Leu Glu Leu Ser Ser Glu GluAs p Glu Glu Asp Glu Glu 165 1 70 1 75 1 80 gaa gat gag gaa gaa atc aagaaa gaa aaa tg c gaa ttt tct gaa gat 630 Glu Asp Glu Glu Glu Ile Lys LysGlu Lys Cy s Glu Phe Ser Glu Asp 185 190 195 gta gac cga ttt ata tgg acggtt ggg cag ga c tat ggt ttg gat gat 678 Val Asp Arg Phe Ile Trp Thr ValGly Gln As p Tyr Gly Leu Asp Asp 200 205 210 ctg gtc gtg cgg cgt gct ctcgcc aag tac ct c gaa gtg gat gtt tcg 726 Leu Val Val Arg Arg Ala Leu AlaLys Tyr Le u Glu Val Asp Val Ser 215 220 225 gac ata ttg gaa aga tac aatgaa ctc aag ct t aag aat gat gga act 774 Asp Ile Leu Glu Arg Tyr Asn GluLeu Lys Le u Lys Asn Asp Gly Thr 230 235 240 gct ggt gag gct tct gat ttgaca tcc aag ac a ata act act gct ttc 822 Ala Gly Glu Ala Ser Asp Leu ThrSer Lys Th r Ile Thr Thr Ala Phe 245 2 50 2 55 2 60 cag gat ttt gct gataga cgt cat tgc cgt cg t tgc atg ata ttc gat 870 Gln Asp Phe Ala Asp ArgArg His Cys Arg Ar g Cys Met Ile Phe Asp 265 270 275 tgt cat atg cat gagaag tat gag ccc gag tc t aga tcc agc gaa gac 918 Cys His Met His Glu LysTyr Glu Pro Glu Se r Arg Ser Ser Glu Asp 280 285 290 aaa tct agt ttg tttgag gat gaa gat aga ca a cca tgc agt gag cat 966 Lys Ser Ser Leu Phe GluAsp Glu Asp Arg Gl n Pro Cys Ser Glu His 295 300 305 tgt tac ctc aag gtgagg agt gtg aca gaa gc t gat cat gtg atg gat 1014 Cys Tyr Leu Lys ValArg Ser Val Thr Glu Al a Asp His Val Met Asp 310 315 320 aat gat aac tctata tca aac aag att gtg gt c tca gat cca aac aac 1062 Asn Asp Asn SerIle Ser Asn Lys Ile Val Va l Ser Asp Pro Asn Asn 325 3 30 3 35 3 40 actatg tgg acg cct gta gag aag gat ctt ta c ttg aaa gga att gag 1110 ThrMet Trp Thr Pro Val Glu Lys Asp Leu Ty r Leu Lys Gly Ile Glu 345 350 355ata ttt ggg aga aac agt tgt gat gtt gca tt a aac ata ctt cgg ggg 1158Ile Phe Gly Arg Asn Ser Cys Asp Val Ala Le u Asn Ile Leu Arg Gly 360 365370 ctt aag acg tgc cta gag att tac aat tac at g cgc gaa caa gat caa1206 Leu Lys Thr Cys Leu Glu Ile Tyr Asn Tyr Me t Arg Glu Gln Asp Gln375 380 385 tgt act atg tca tta gac ctt aac aaa act ac a caa aga cac aatcag 1254 Cys Thr Met Ser Leu Asp Leu Asn Lys Thr Th r Gln Arg His AsnGln 390 395 400 gtt acc aaa aaa gta tct cga aaa agt agt ag g tcg gtc cgcaaa aaa 1302 Val Thr Lys Lys Val Ser Arg Lys Ser Ser Ar g Ser Val ArgLys Lys 405 4 10 4 15 4 20 tcg aga ctc cga aaa tat gct cgt tat ccg cc tgct tta aag aaa aca 1350 Ser Arg Leu Arg Lys Tyr Ala Arg Tyr Pro Pr oAla Leu Lys Lys Thr 425 430 435 act agt gga gaa gct aag ttt tat aag cacta c aca cca tgc act tgc 1398 Thr Ser Gly Glu Ala Lys Phe Tyr Lys His Tyr Thr Pro Cys Thr Cys 440 445 450 aag tca aaa tgt gga cag caa tgc ccttgt tt a act cac gaa aat tgc 1446 Lys Ser Lys Cys Gly Gln Gln Cys ProCys Le u Thr His Glu Asn Cys 455 460 465 tgc gag aaa tat tgc ggg tgc tcaaag gat tg c aac aat cgc ttt gga 1494 Cys Glu Lys Tyr Cys Gly Cys SerLys Asp Cy s Asn Asn Arg Phe Gly 470 475 480 gga tgt aat tgt gca att ggccaa tgc aca aa t cga caa tgt cct tgt 1542 Gly Cys Asn Cys Ala Ile GlyGln Cys Thr As n Arg Gln Cys Pro Cys 485 4 90 4 95 5 00 ttt gct gct aatcgt gaa tgc gat cca gat ct t tgt cgg agt tgt cct 1590 Phe Ala Ala AsnArg Glu Cys Asp Pro Asp Le u Cys Arg Ser Cys Pro 505 510 515 ctt agc tgtgga gat ggc act ctt ggt gag ac a cca gtg caa atc caa 1638 Leu Ser CysGly Asp Gly Thr Leu Gly Glu Th r Pro Val Gln Ile Gln 520 525 530 tgc aagaac atg caa ttc ctc ctt caa acc aa t aaa aag att ctc att 1686 Cys LysAsn Met Gln Phe Leu Leu Gln Thr As n Lys Lys Ile Leu Ile 535 540 545 ggaaag tct gat gtt cat gga tgg ggt gca tt t aca tgg gac tct ctt 1734 GlyLys Ser Asp Val His Gly Trp Gly Ala Ph e Thr Trp Asp Ser Leu 550 555 560aaa aag aat gag tat ctc gga gaa tat act gg a gaa ctg atc act cat 1782Lys Lys Asn Glu Tyr Leu Gly Glu Tyr Thr Gl y Glu Leu Ile Thr His 565 570 5 75 5 80 gat gaa gct aat gag cgt ggg aga ata gaa ga t cgg att ggttct tcc 1830 Asp Glu Ala Asn Glu Arg Gly Arg Ile Glu As p Arg Ile GlySer Ser 585 590 595 tac ctc ttt acc ttg aat gat cag ctc gaa at c gat gctcgc cgt aaa 1878 Tyr Leu Phe Thr Leu Asn Asp Gln Leu Glu Il e Asp AlaArg Arg Lys 600 605 610 gga aac gag ttc aaa ttt ctc aat cac tca gc a agacct aac tgc tac 1926 Gly Asn Glu Phe Lys Phe Leu Asn His Ser Al a ArgPro Asn Cys Tyr 615 620 625 gcc aag ttg atg att gtg aga gga gat cag ag gatt ggt cta ttt gcg 1974 Ala Lys Leu Met Ile Val Arg Gly Asp Gln Ar gIle Gly Leu Phe Ala 630 635 640 gag aga gca atc gaa gaa ggt gag gag ctttt c ttc gac tac tgc tat 2022 Glu Arg Ala Ile Glu Glu Gly Glu Glu Leu Phe Phe Asp Tyr Cys Tyr 645 6 50 6 55 6 60 gga cca gaa cat gcg gat tgg tcgcgt ggt cg a gaa cct aga aag act 2070 Gly Pro Glu His Ala Asp Trp SerArg Gly Ar g Glu Pro Arg Lys Thr 665 670 675 ggt gct tct aaa agg tct aaggaa gcc cgt cc a gct cgt tagtttttga 2119 Gly Ala Ser Lys Arg Ser Lys GluAla Arg Pr o Ala Arg 680 685 tctgaggaga agcagca 2136 <210> SEQ ID NO: 2<211> LENGTH: 689 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 2 Met Glu Lys Glu Asn His Glu Asp Asp Gly Gl u Gly Leu Pro ProGlu 1 5 10 15 Leu Asn Gln Ile Lys Glu Gln Ile Glu Lys Gl u Arg Phe LeuHis Ile 20 25 30 Lys Arg Lys Phe Glu Leu Arg Tyr Ile Pro Se r Val AlaThr His Ala 35 40 45 Ser His His Gln Ser Phe Asp Leu Asn Gln Pr o AlaAla Glu Asp Asp 50 55 60 Asn Gly Gly Asp Asn Lys Ser Leu Leu Ser Ar gMet Gln Asn Pro Leu 65 70 75 80 Arg His Phe Ser Ala Ser Ser Asp Tyr AsnSe r Tyr Glu Asp Gln Gly 85 90 95 Tyr Val Leu Asp Glu Asp Gln Asp TyrAla Le u Glu Glu Asp Val Pro 100 105 110 Leu Phe Leu Asp Glu Asp Val ProLeu Leu Pr o Ser Val Lys Leu Pro 115 120 125 Ile Val Glu Lys Leu Pro ArgSer Ile Thr Tr p Val Phe Thr Lys Ser 130 135 140 Ser Gln Leu Met Ala GluSer Asp Ser Val Il e Gly Lys Arg Gln Ile 145 1 50 1 55 1 60 Tyr Tyr LeuAsn Gly Glu Ala Leu Glu Leu Se r Ser Glu Glu Asp Glu 165 170 175 Glu AspGlu Glu Glu Asp Glu Glu Glu Ile Ly s Lys Glu Lys Cys Glu 180 185 190 PheSer Glu Asp Val Asp Arg Phe Ile Trp Th r Val Gly Gln Asp Tyr 195 200 205Gly Leu Asp Asp Leu Val Val Arg Arg Ala Le u Ala Lys Tyr Leu Glu 210 215220 Val Asp Val Ser Asp Ile Leu Glu Arg Tyr As n Glu Leu Lys Leu Lys 2252 30 2 35 2 40 Asn Asp Gly Thr Ala Gly Glu Ala Ser Asp Le u Thr Ser LysThr Ile 245 250 255 Thr Thr Ala Phe Gln Asp Phe Ala Asp Arg Ar g His CysArg Arg Cys 260 265 270 Met Ile Phe Asp Cys His Met His Glu Lys Ty r GluPro Glu Ser Arg 275 280 285 Ser Ser Glu Asp Lys Ser Ser Leu Phe Glu As pGlu Asp Arg Gln Pro 290 295 300 Cys Ser Glu His Cys Tyr Leu Lys Val ArgSe r Val Thr Glu Ala Asp 305 3 10 3 15 3 20 His Val Met Asp Asn Asp AsnSer Ile Ser As n Lys Ile Val Val Ser 325 330 335 Asp Pro Asn Asn Thr MetTrp Thr Pro Val Gl u Lys Asp Leu Tyr Leu 340 345 350 Lys Gly Ile Glu IlePhe Gly Arg Asn Ser Cy s Asp Val Ala Leu Asn 355 360 365 Ile Leu Arg GlyLeu Lys Thr Cys Leu Glu Il e Tyr Asn Tyr Met Arg 370 375 380 Glu Gln AspGln Cys Thr Met Ser Leu Asp Le u Asn Lys Thr Thr Gln 385 3 90 3 95 4 00Arg His Asn Gln Val Thr Lys Lys Val Ser Ar g Lys Ser Ser Arg Ser 405 410415 Val Arg Lys Lys Ser Arg Leu Arg Lys Tyr Al a Arg Tyr Pro Pro Ala 420425 430 Leu Lys Lys Thr Thr Ser Gly Glu Ala Lys Ph e Tyr Lys His Tyr Thr435 440 445 Pro Cys Thr Cys Lys Ser Lys Cys Gly Gln Gl n Cys Pro Cys LeuThr 450 455 460 His Glu Asn Cys Cys Glu Lys Tyr Cys Gly Cy s Ser Lys AspCys Asn 465 4 70 4 75 4 80 Asn Arg Phe Gly Gly Cys Asn Cys Ala Ile Gl yGln Cys Thr Asn Arg 485 490 495 Gln Cys Pro Cys Phe Ala Ala Asn Arg GluCy s Asp Pro Asp Leu Cys 500 505 510 Arg Ser Cys Pro Leu Ser Cys Gly AspGly Th r Leu Gly Glu Thr Pro 515 520 525 Val Gln Ile Gln Cys Lys Asn MetGln Phe Le u Leu Gln Thr Asn Lys 530 535 540 Lys Ile Leu Ile Gly Lys SerAsp Val His Gl y Trp Gly Ala Phe Thr 545 5 50 5 55 5 60 Trp Asp Ser LeuLys Lys Asn Glu Tyr Leu Gl y Glu Tyr Thr Gly Glu 565 570 575 Leu Ile ThrHis Asp Glu Ala Asn Glu Arg Gl y Arg Ile Glu Asp Arg 580 585 590 Ile GlySer Ser Tyr Leu Phe Thr Leu Asn As p Gln Leu Glu Ile Asp 595 600 605 AlaArg Arg Lys Gly Asn Glu Phe Lys Phe Le u Asn His Ser Ala Arg 610 615 620Pro Asn Cys Tyr Ala Lys Leu Met Ile Val Ar g Gly Asp Gln Arg Ile 625 630 6 35 6 40 Gly Leu Phe Ala Glu Arg Ala Ile Glu Glu Gl y Glu Glu LeuPhe Phe 645 650 655 Asp Tyr Cys Tyr Gly Pro Glu His Ala Asp Tr p Ser ArgGly Arg Glu 660 665 670 Pro Arg Lys Thr Gly Ala Ser Lys Arg Ser Ly s GluAla Arg Pro Ala 675 680 685 Arg <210> SEQ ID NO: 3 <211> LENGTH: 1563<212> TYPE: DNA <213> ORGANISM: Arabidopsis sp. <220> FEATURE: <221>NAME/KEY: CDS <222> LOCATION: (199)..(1308) <223> OTHER INFORMATION:fertilization-independent endos perm 3 (FIE3) cDNA <400> SEQUENCE: 3aaaggtgagt tgtgtgttgt gtcaggtcca aaataaaagt ttgtcgtgag gt caaaatct 60acggttacag taattttaat aacctgtgaa tctgtgtcta atcgaaaatt ac aaaacacc 120agttgttgtt gcatgagaga cttgtgagct tagattagtg tgcgagagtc ag acagagag 180agagatttcg aatatcga atg tcg aag ata acc tta ggg aac gag tca ata 231 MetSer Lys Ile Thr Leu Gly Asn Glu Ser Ile 1 5 10 gtt ggg tct ttg act ccatcg aat aag aaa tc g tac aaa gtg acg aat 279 Val Gly Ser Leu Thr Pro SerAsn Lys Lys Se r Tyr Lys Val Thr Asn 15 20 25 agg att cag gaa ggg aagaaa cct ttg tat gc t gtt gtt ttc aac ttc 327 Arg Ile Gln Glu Gly Lys LysPro Leu Tyr Al a Val Val Phe Asn Phe 30 35 40 ctt gat gct cgt ttc ttcgat gtc ttc gtt ac c gct ggt gga aat cgg 375 Leu Asp Ala Arg Phe Phe AspVal Phe Val Th r Ala Gly Gly Asn Arg 45 50 55 att act ctg tac aat tgtctc gga gat ggt gc c ata tca gca ttg caa 423 Ile Thr Leu Tyr Asn Cys LeuGly Asp Gly Al a Ile Ser Ala Leu Gln 60 65 70 75 tcc tat gct gat gaa gataag gaa gag tcg tt t tac acg gta agt tgg 471 Ser Tyr Ala Asp Glu Asp LysGlu Glu Ser Ph e Tyr Thr Val Ser Trp 80 85 90 gcg tgt ggc gtt aat gggaac cca tat gtt gc g gct gga gga gta aaa 519 Ala Cys Gly Val Asn Gly AsnPro Tyr Val Al a Ala Gly Gly Val Lys 95 100 105 ggt ata atc cga gtc attgac gtc aac agt ga a acg att cat aag agt 567 Gly Ile Ile Arg Val Ile AspVal Asn Ser Gl u Thr Ile His Lys Ser 110 115 120 ctt gtg ggt cat gga gattca gtg aac gaa at c agg aca caa cct tta 615 Leu Val Gly His Gly Asp SerVal Asn Glu Il e Arg Thr Gln Pro Leu 125 130 135 aaa cct caa ctt gtg attact gct agc aag ga t gaa tct gtt cgt ttg 663 Lys Pro Gln Leu Val Ile ThrAla Ser Lys As p Glu Ser Val Arg Leu 140 1 45 1 50 1 55 tgg aat gtt gaaact ggg ata tgt att ttg at a ttt gct gga gct gga 711 Trp Asn Val Glu ThrGly Ile Cys Ile Leu Il e Phe Ala Gly Ala Gly 160 165 170 ggt cat cgc tatgaa gtt cta agt gtg gat tt t cat ccg tct gat att 759 Gly His Arg Tyr GluVal Leu Ser Val Asp Ph e His Pro Ser Asp Ile 175 180 185 tac cgc ttt gctagt tgt ggt atg gac acc ac t att aaa ata tgg tca 807 Tyr Arg Phe Ala SerCys Gly Met Asp Thr Th r Ile Lys Ile Trp Ser 190 195 200 atg aaa gag ttttgg acg tac gtc gag aag tc a ttc aca tgg act gat 855 Met Lys Glu Phe TrpThr Tyr Val Glu Lys Se r Phe Thr Trp Thr Asp 205 210 215 gat cca tca aaattc ccc aca aaa ttt gtc ca a ttc cct gta ttt aca 903 Asp Pro Ser Lys PhePro Thr Lys Phe Val Gl n Phe Pro Val Phe Thr 220 2 25 2 30 2 35 gct tccatt cat aca aat tat gta gat tgt aa c cgt tgg ttt ggt gat 951 Ala Ser IleHis Thr Asn Tyr Val Asp Cys As n Arg Trp Phe Gly Asp 240 245 250 ttt atcctc tca aag agt gtg gac aac gag at c ctg ttg tgg gaa cca 999 Phe Ile LeuSer Lys Ser Val Asp Asn Glu Il e Leu Leu Trp Glu Pro 255 260 265 caa ctgaaa gag aat tct cct ggc gag gga gc t tca gat gtt cta tta 1047 Gln LeuLys Glu Asn Ser Pro Gly Glu Gly Al a Ser Asp Val Leu Leu 270 275 280 agatac ccg gtt cca atg tgt gat att tgg tt t atc aag ttt tct tgt 1095 ArgTyr Pro Val Pro Met Cys Asp Ile Trp Ph e Ile Lys Phe Ser Cys 285 290 295gac ctc cat tta agt tct gtt gcg ata ggt aa t cag gaa gga aag gtt 1143Asp Leu His Leu Ser Ser Val Ala Ile Gly As n Gln Glu Gly Lys Val 300 305 3 10 3 15 tat gtc tgg gat ttg aaa agt tgc cct cct gt t ttg att acaaag tta 1191 Tyr Val Trp Asp Leu Lys Ser Cys Pro Pro Va l Leu Ile ThrLys Leu 320 325 330 tca cac aat caa tca aag tct gta atc agg ca a aca gccatg tct gtc 1239 Ser His Asn Gln Ser Lys Ser Val Ile Arg Gl n Thr AlaMet Ser Val 335 340 345 gat gga agc acg att ctt gct tgc tgc gag ga c gggact ata tgg cgc 1287 Asp Gly Ser Thr Ile Leu Ala Cys Cys Glu As p GlyThr Ile Trp Arg 350 355 360 tgg gac gtg att acc aag tagcggtctgagtcttgtag ga attgatga 1335 Trp Asp Val Ile Thr Lys 365 attaggagtgcgaagaaatg agatatccat tcttttattg taattctgat ca tgttgcta 1395 ctccctgagaccttgagatg ctctttgtag ccttgttaac gtccaccctt gt accacagt 1455 gtataccctttctggagatt ttgtcttatt ctcttagttc aatacacaag gc tgtatcct 1515 ggagctttattgcaggaacc actctctttc ataagctttc tagtattc 1563 <210> SEQ ID NO: 4 <211>LENGTH: 369 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 4 Met Ser Lys Ile Thr Leu Gly Asn Glu Ser Il e Val Gly Ser LeuThr 1 5 10 15 Pro Ser Asn Lys Lys Ser Tyr Lys Val Thr As n Arg Ile GlnGlu Gly 20 25 30 Lys Lys Pro Leu Tyr Ala Val Val Phe Asn Ph e Leu AspAla Arg Phe 35 40 45 Phe Asp Val Phe Val Thr Ala Gly Gly Asn Ar g IleThr Leu Tyr Asn 50 55 60 Cys Leu Gly Asp Gly Ala Ile Ser Ala Leu Gl nSer Tyr Ala Asp Glu 65 70 75 80 Asp Lys Glu Glu Ser Phe Tyr Thr Val SerTr p Ala Cys Gly Val Asn 85 90 95 Gly Asn Pro Tyr Val Ala Ala Gly GlyVal Ly s Gly Ile Ile Arg Val 100 105 110 Ile Asp Val Asn Ser Glu Thr IleHis Lys Se r Leu Val Gly His Gly 115 120 125 Asp Ser Val Asn Glu Ile ArgThr Gln Pro Le u Lys Pro Gln Leu Val 130 135 140 Ile Thr Ala Ser Lys AspGlu Ser Val Arg Le u Trp Asn Val Glu Thr 145 1 50 1 55 1 60 Gly Ile CysIle Leu Ile Phe Ala Gly Ala Gl y Gly His Arg Tyr Glu 165 170 175 Val LeuSer Val Asp Phe His Pro Ser Asp Il e Tyr Arg Phe Ala Ser 180 185 190 CysGly Met Asp Thr Thr Ile Lys Ile Trp Se r Met Lys Glu Phe Trp 195 200 205Thr Tyr Val Glu Lys Ser Phe Thr Trp Thr As p Asp Pro Ser Lys Phe 210 215220 Pro Thr Lys Phe Val Gln Phe Pro Val Phe Th r Ala Ser Ile His Thr 2252 30 2 35 2 40 Asn Tyr Val Asp Cys Asn Arg Trp Phe Gly As p Phe Ile LeuSer Lys 245 250 255 Ser Val Asp Asn Glu Ile Leu Leu Trp Glu Pr o Gln LeuLys Glu Asn 260 265 270 Ser Pro Gly Glu Gly Ala Ser Asp Val Leu Le u ArgTyr Pro Val Pro 275 280 285 Met Cys Asp Ile Trp Phe Ile Lys Phe Ser Cy sAsp Leu His Leu Ser 290 295 300 Ser Val Ala Ile Gly Asn Gln Glu Gly LysVa l Tyr Val Trp Asp Leu 305 3 10 3 15 3 20 Lys Ser Cys Pro Pro Val LeuIle Thr Lys Le u Ser His Asn Gln Ser 325 330 335 Lys Ser Val Ile Arg GlnThr Ala Met Ser Va l Asp Gly Ser Thr Ile 340 345 350 Leu Ala Cys Cys GluAsp Gly Thr Ile Trp Ar g Trp Asp Val Ile Thr 355 360 365 Lys <210> SEQID NO: 5 <211> LENGTH: 5801 <212> TYPE: DNA <213> ORGANISM: Arabidopsissp. <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (3872)..(5566)<223> OTHER INFORMATION: fertilization-independent endos perm 3 (FIE3)WD40/polycomb gene genomic sequence <400> SEQUENCE: 5 tctgaagcagctaatcgatc cactaatctt gtggagatcg tgtgttgctt tg gtgcatat 60 atatacaaatagacaaatac atatgcgttt acatatatat gtaagcacgt at ttagagag 120 caacaataaggcatgagaaa tgtgattatc gtcaaatcat gattgctaca tg acaaatcg 180 atcttaattttgaaaaagag acatttaaat attcaaaaaa cggtaaaaat tt ctttaaga 240 ccaaccatggaaataacatg agaagactga gagggagatt agaacttaca ac aagagaat 300 ctttttccttcaatattttt tttaaacact tttcttttgt agggaatttg at aatatgaa 360 atggatagattttactgctt aatttttaat cattttttat cagaaacttt tt cgttttaa 420 atctacggctagaattttcg gtcggtttta tactttatat agatgctaga tt tttttctt 480 ctagtcatcgtttattagta caattttgtt tttatatatt gattacttga at ttataata 540 ggattggtacaaaggtggta attataaagt gcattttttt ggatattgtt ca attcaaat 600 atttttacttagattctcaa actattgaaa aatatccaaa atatccggaa aa tttcaatt 660 taatcgaataaaaaaattag aatggaaaga ataaaaaatt atcgggtaca at tagaagag 720 taatgtgtttagtttggttt ttactcggat accagttcag ttttcacgta tt attcgatc 780 ctataggagcaattgtgaat tagttgtgag attttgggag cattcgcttc ca gaacttag 840 tgctaggagaaatgctattt tcctataaga gttgtacgag gaagcgagca ag tacacaac 900 aaccacaaaagctttcaata cttgtttact cctagggttt aaaactagag gt tctataga 960 tctctaaatttttttgaaca aatgtgtttt ccacacgtga tattctacaa ta ccactcga 1020 aaattatccataattgcttt aaactatttt tttgtttaaa ttatataatt tg taccgttg 1080 taaactgattatttcaaatt ataattaaag cactataatt tcatatatta ca ttcaacat 1140 atattaaaataaactataac catgtatttt tttgtcttcc tttcctataa ac attgattg 1200 gactctatcgtaaattttgt cgttatcgca aattttgtcg ttatcgatga gt ttctcaaa 1260 gtttggaccttgattatctt gtttggagat gttcaaatcg ttatatccaa at agtgaact 1320 tctaattttcttttttgata atgtgactta tttggaaaag tattccaaag ta ttcaaata 1380 aaccctttaaaaatccatta aatacatttt aaataagtaa aatgctctca ac gaagagat 1440 atcatggtaaataacaacag tgagaggata aaatgttaaa tcaatttatt ta caacttca 1500 aataggcggacatcaaacct acttagcaca ctttctattt tcaaattggt ta tggtttgt 1560 ctattagttgttgcatctat gttttttaat tcttatatcg gtgatcttga tt ttgttttg 1620 gtgtatctaaaatctatttt agttaaagtg caagaaaata aaataaaaac tt aaggtaag 1680 agatgaaagtaagctttaaa taaaacagag cacttctatg gtcgattata ga gccaagtt 1740 cgttcctccattttggctta atgcaatatt acaagtaaat cttataaaac tt tccataag 1800 tatcgtattacccatggata ctatgatata taaactctcg gaggtgtagt cc agaagaaa 1860 tgatccatatttgcatacag taaacttgat ggaaaaaata tgtggtactg tt ggaattgt 1920 agctattgagtatcaaattt gagaaaaagg taaaaaaata tgtaaaattt gg gtggaaga 1980 aaagaattacataaaattga gaaatgtatg taattgacaa aataatgttt tc aaaacata 2040 aaaacgtgataccatttaaa tccaaacctt atatcattta accattttta gt aaaactaa 2100 tagtaatgaatggtcaataa tataagatta catattaaat aattactact tt cagaaaat 2160 ttcaatcaaatctataatat tcctttgaaa aaaaagaaag acaaataggt aa acttcgat 2220 cgtatcaatcaaagaatata tttatttttc atcgtaacgt ttaattctaa gt cctattaa 2280 aaaacgttaaatttgatttt tcttaccatt tttttctaaa aggtgagttg tg tgttgtgt 2340 caggtccaaaataaaagttt gtcgtgaggt caaaatctac ggttacagta at tttaataa 2400 cctgtgaatctgtgtctaat cgaaaattac aaaacaccag ttgttgttgc at gagagact 2460 tgtgagcttagattagtgtg cgagagtcag acagagagag agatttcgaa ta tcgaatgt 2520 cgaagataaccttagggaac gagtcaatag ttgggtcttt gactccatcg aa taagaaat 2580 cgtacaaagtgacgaatagg attcaggaag ggaagaaacc tttgtatgct gt tgttttca 2640 acttccttgatgctcgtttc ttcgatgtct tcgttaccgc tggtggaaat cg ggtaaaag 2700 atctcgactttcaattcgaa atcactgttt tcaattctgg gtctgtttag gt tttgattc 2760 agattgattgtaacattaag gcctttcctt ttgtgtttga ttttggattc tg atttctag 2820 cctttagtgagattaaaaga ttgaaacttt gcttgatgct atagtctaag at tatgtaac 2880 atttagttcaaactttctgg ttttggagat tttgtggaag atatggtttt tg ttttctaa 2940 tttaaagtgaactcattacc ttatacactt gatttgcatt ctgttctaaa aa aaattgaa 3000 actttggttgatgttgttag tctgcttatc taaggaggtt ccttttgaaa cg gtcatcaa 3060 gtgagttatgaagcgtttag tttaagcttt cctgtattgg agattttgtg ga agttattt 3120 ttttttctaattttgaaact agatagagtg aagtcattac cttatacatt ag actgctct 3180 attttgttttcaatgtgggt tccgaatgta cctgatagtg gctctttagg ct catttgta 3240 ttcgtcgaaacatcgatcgg atacccgttt gggcttagta ggctctgata cc gcgtaaag 3300 ttctcgggttccatgaaaaa ccaatcggta atgagtggag ttaatttgta at cgtcttcg 3360 gtcgagcatttgggattagt gggctttgat accatgtgaa agtccttggg gt ccaatcgg 3420 caatgagtagagttaacttg taatcttaca cacttggtta ggtctcattc tc tttataat 3480 gttgtgtgcctaacagtttc cgcactaagg ttgtttggtt gctcagtctc aa tatactta 3540 tcttaactagttgtagtttt tttcatcttt cctagtttcc gttggatttt aa attgaatg 3600 atttactagttagaaatatt tgagtttctc atagaagctt taaccaaggg gt tctttcat 3660 ttaacctttacttagctagt tcatgaatct cattactgcc attggtgtat ct cttattat 3720 gtagattactctgtacaatt gtctcggaga tggtgccata tcagcattgc aa tcctatgc 3780 tgatgaagatgtaaggaagc atacatatta gcttttccat caaattaaag ta agtgatgt 3840 ttcactgaggccatttggtt atattttgtc tatgtcctct ggagagcaga ag gaagagtc 3900 gttttacacggtaagttggg cgtgtggcgt taatgggaac ccatatgttg cg gctggagg 3960 agtaaaaggtataatccgag tcattgacgt caacagtgaa acgattcata ag gtattatt 4020 gcatttttatggatgttcta tgtatcctag caaatgattc tatatctttc tt gtataatc 4080 tgtgctcgcaaatgtgcaga gtcttgtggg tcatggagat tcagtgaacg aa atcaggac 4140 acaacctttaaaacctcaac ttgtgattac tgctagcaag gtatatctct tg gctttctt 4200 ttcttcctaaagtatcctga cttctttttt atttgttggt gattaagagc tg ttacgttt 4260 taattgaataaggatgaatc tgttcgtttg tggaatgttg aaactgggat at gtattttg 4320 atatttgctggagctggagg tcatcgctat gaagttctaa gtgtggtgag cc aatattgt 4380 tttatctaattcagttagtt ttctacaata atatatagag acaatgttaa gg ggaaccat 4440 cttattttgaaaattgtagg attttcatcc gtctgatatt taccgctttg ct agttgtgg 4500 tatggacaccactattaaaa tatggtcaat gaaaggtacg atcgagcaca ta ttgtaata 4560 aacttccattttaaaaaacc ttttgagaaa aatggcttgt ggttcgtttg ta tgatcttc 4620 ttattctttggctgtctata gagttttgga cgtacgtcga gaagtcattc ac atggactg 4680 atgatccatcaaaattcccc acaaaatttg tccaattccc tgtaagtatt tt gttttagc 4740 cttgtcttgtaacaacaagt gacatacaaa tattggtgat ggcctttgta aa taacatta 4800 cttctatatgtaggtattta cagcttccat tcatacaaat tatgtagatt gt aaccgttg 4860 gtttggtgattttatcctct caaaggttag taagtcaatg atggttaaga tt aattcatt 4920 tggtgtactgttaaaacact ttactcttgt gttgttctat cggattttag ag tgtggaca 4980 acgagatcctgttgtgggaa ccacaactga aagagaattc tcctggcgag gt taggatct 5040 cattgttgctccaaacacaa cataatcatt catttcatca catatattta ca gttgaact 5100 ttttgtggtttgcagggagc ttcagatgtt ctattaagat acccggttcc aa tgtgtgat 5160 atttggtttatcaagttttc ttgtgacctc catttaagtt ctgttgcgat ag gtaatcag 5220 agagctcgttagatacaaat ttgcattcta tagatagatt acttcaactt tt cttattca 5280 ttttgtgacaaattactcgc tggtttgtta tcaggtaatc aggaaggaaa gg tttatgtc 5340 tgggatttgaaaagttgccc tcctgttttg attacaaagt aagttagttt cg gattcaga 5400 tacaatgtttgatctttaag aaatgtttta gtcttgacat gattttctgt tg ccatatag 5460 gttatcacacaatcaatcaa agtctgtaat caggcaaaca gccatgtctg tc gatggaag 5520 gtataaatccatcttctctc tcaccaatgc agtgaaaatt tcttaatgtt at ttatgact 5580 caatagttactgtaaatcaa accaaacttt ggattctgac acactgtttc tt ccatggga 5640 ttgtagcacgattcttgctt gctgcgagga cgggactata tggcgctggg ac gtgattac 5700 caagtagcggtctgagtctt gtaggaattg atgaattagg agtgcgaaga aa tgagatat 5760 ccattcttttattgtaattc tgatcatgtt gctactccct g 5801 <210> SEQ ID NO: 6 <211> LENGTH:7015 <212> TYPE: DNA <213> ORGANISM: Arabidopsis sp. <220> FEATURE:<221> NAME/KEY: CDS <222> LOCATION: (1)..(7014) <223> OTHER INFORMATION:fertilization-independent endos perm 1 (FIE1) SET/polycomb gene genomicsequence r eading frame 1 <220> FEATURE: <221> NAME/KEY: CDS <222>LOCATION: (2)..(7015) <223> OTHER INFORMATION: fertilization-independentendos perm 1 (FIE1) SET/polycomb gene genomic sequence r eading frame 2<220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (3)..(7013) <223>OTHER INFORMATION: fertilization-independent endos perm 1 (FIE1)SET/polycomb gene genomic sequence r eading frame 3 <400> SEQUENCE: 6gga tcc att att ttt aaa aat caa att ttt tc a tat cta tta ttt gtt 48 GlySer Ile Ile Phe Lys Asn Gln Ile Phe Se r Tyr Leu Leu Phe Val 1 5 10 15tca aag aaa aaa aaa aca cac gac gat tat cc a tct gcc ggc tgt gtt 96 SerLys Lys Lys Lys Thr His Asp Asp Tyr Pr o Ser Ala Gly Cys Val 20 25 30cat cgg taa acc tat att tta aaa ctg gtg gg c ttt tca tta cca taa 144 HisArg Thr Tyr Ile Leu Lys Leu Val Gly Phe Ser Leu Pro 35 40 45 gtt tgg acatgt ttt tat aat ttg atg tat ag t gta gac caa aaa ata 192 Val Trp Thr CysPhe Tyr Asn Leu Met Tyr Se r Val Asp Gln Lys Ile 50 55 60 gag aaa taagaa agg gaa cct ttg tgg tga tt g taa caa aac aga aat 240 Glu Lys Glu ArgGlu Pro Leu Trp Leu Gln Asn Arg Asn 65 70 75 80 cat tat att gaa tca ttcgaa aag acg aaa ag a tca aac ctt tgt agc 288 His Tyr Ile Glu Ser Phe GluLys Thr Lys Ar g Ser Asn Leu Cys Ser 85 90 95 tag atg acc ata gac gtggct gcc aat tac ag t ctt aat gct ttt ata 336 Met Thr Ile Asp Val Ala AlaAsn Tyr Ser Leu Asn Ala Phe Ile 100 105 110 tag atc ttt ctt aca tcc tctgtt cct tca ca t tca aga aac agt atc 384 Ile Phe Leu Thr Ser Ser Val ProSer His Ser Arg Asn Ser Ile 115 120 125 atc cca ttt tct ttc ttc ttc tcagtg ttt ca a tct ttg cga att aag 432 Ile Pro Phe Ser Phe Phe Phe Ser ValPhe Gl n Ser Leu Arg Ile Lys 130 135 140 atg gaa cat gaa gaa aca caa aagaac aca ag a aac agc tgg tcc ctg 480 Met Glu His Glu Glu Thr Gln Lys AsnThr Ar g Asn Ser Trp Ser Leu 145 1 50 1 55 1 60 att cga cca ttt caa atgatc tcc att agc tt t ctt agc ctc ctc ctc 528 Ile Arg Pro Phe Gln Met IleSer Ile Ser Ph e Leu Ser Leu Leu Leu 165 170 175 cct cta tct ttc ctc tttctt tca cgt ctc tc t ctc tat acc tcc tca 576 Pro Leu Ser Phe Leu Phe LeuSer Arg Leu Se r Leu Tyr Thr Ser Ser 180 185 190 act ccg gtc acc gtc tccggc gtt tcc tct gt t att cac cag gca gat 624 Thr Pro Val Thr Val Ser GlyVal Ser Ser Va l Ile His Gln Ala Asp 195 200 205 gtc gga gtc tta tac acgatc ttg ttt ctc at c atc gtc ttc act tta 672 Val Gly Val Leu Tyr Thr IleLeu Phe Leu Il e Ile Val Phe Thr Leu 210 215 220 atc cac agt ctc tca ggaaaa cca gaa tgc tc t gtt ctc cat tcc cat 720 Ile His Ser Leu Ser Gly LysPro Glu Cys Se r Val Leu His Ser His 225 2 30 2 35 2 40 ctc tac atc tgctgg atc gtt ctc ttc atc gc c caa gct tgt gcc ttt 768 Leu Tyr Ile Cys TrpIle Val Leu Phe Ile Al a Gln Ala Cys Ala Phe 245 250 255 ggg atc aaa agaacc atg agc acg acc atg tc t ata aat cca gac aaa 816 Gly Ile Lys Arg ThrMet Ser Thr Thr Met Se r Ile Asn Pro Asp Lys 260 265 270 aac ttg ttt cttgcg aca cat gaa aga tgg at g ttg gtt agg gtt ttg 864 Asn Leu Phe Leu AlaThr His Glu Arg Trp Me t Leu Val Arg Val Leu 275 280 285 ttc ttt ttg gggcta cac gaa gtg atg ctg at g tgg ttt aga gtc gtg 912 Phe Phe Leu Gly LeuHis Glu Val Met Leu Me t Trp Phe Arg Val Val 290 295 300 gtt aag cct gtggtt gac aac act ata tat gg g gtc tac gtg gag gag 960 Val Lys Pro Val ValAsp Asn Thr Ile Tyr Gl y Val Tyr Val Glu Glu 305 3 10 3 15 3 20 agg tggtcc gag aga gcc gtt gtg gca gtg ac c ttt ggt ata atg tgg 1008 Arg TrpSer Glu Arg Ala Val Val Ala Val Th r Phe Gly Ile Met Trp 325 330 335 tggtgg agg cta aga gat gag gta gaa agt ct t gtg gtg gtg gtt acg 1056 TrpTrp Arg Leu Arg Asp Glu Val Glu Ser Le u Val Val Val Val Thr 340 345 350gcg gat aga ctt aac ctc ccc att cgt ttg ga g ggt ctc aat ttt gtg 1104Ala Asp Arg Leu Asn Leu Pro Ile Arg Leu Gl u Gly Leu Asn Phe Val 355 360365 aac tgg tgt atg tat tac atc tgt gtt gga at t ggt tta atg aag atc1152 Asn Trp Cys Met Tyr Tyr Ile Cys Val Gly Il e Gly Leu Met Lys Ile370 375 380 ttc aaa ggg ttt ttg gat ttt gtg aat acg tt g act ttg agc attaag 1200 Phe Lys Gly Phe Leu Asp Phe Val Asn Thr Le u Thr Leu Ser IleLys 385 3 90 3 95 4 00 agg tcg aga aaa ggc tgt gaa tca tgt gtt tt t gatgat atg tgt aat 1248 Arg Ser Arg Lys Gly Cys Glu Ser Cys Val Ph e AspAsp Met Cys Asn 405 410 415 gat gat cat gtg taa gat att tga cat att at actc atc tct tga atg 1296 Asp Asp His Val Asp Ile His Ile Ile Leu Ile SerMet 420 425 430 ttt ttg aga ttt ttt tat ttt tat ttt cta tt t ctt gct aggaat tta 1344 Phe Leu Arg Phe Phe Tyr Phe Tyr Phe Leu Ph e Leu Ala ArgAsn Leu 435 440 445 acc cgt ata tat gtc aca aaa ata gta gaa ta t cag aaagca aaa ata 1392 Thr Arg Ile Tyr Val Thr Lys Ile Val Glu Ty r Gln LysAla Lys Ile 450 455 460 ttt tat cta aaa ata acc att gaa cat taa tt t aagtct ttt tat aat 1440 Phe Tyr Leu Lys Ile Thr Ile Glu His Phe Lys Ser PheTyr Asn 465 4 70 4 75 4 80 tat att ttt ata aca cac cct ttt taa gaa aa actt gga gat tta att 1488 Tyr Ile Phe Ile Thr His Pro Phe Glu Lys Leu GlyAsp Leu Ile 485 490 495 aac gtt ata aat agt aaa aaa tat cgg att ta c gtagaa gtt tta aat 1536 Asn Val Ile Asn Ser Lys Lys Tyr Arg Ile Ty r ValGlu Val Leu Asn 500 505 510 gcg tat aat taa att tac gaa ttg aat aat at agcc ata tat ata ttt 1584 Ala Tyr Asn Ile Tyr Glu Leu Asn Asn Ile Ala IleTyr Ile Phe 515 520 525 ttg aag att taa act cat ttt gtt tct tcc at a tatgca taa tat ata 1632 Leu Lys Ile Thr His Phe Val Ser Ser Ile Tyr Ala TyrIle 530 535 540 agc tta aat aga aaa cta gct agg aat gaa ta c taa tat atataa tga 1680 Ser Leu Asn Arg Lys Leu Ala Arg Asn Glu Ty r Tyr Ile 545 550 5 55 5 60 cat taa tat aag tct tac cgg aca ctc caa aa t gta tat attgat cta 1728 His Tyr Lys Ser Tyr Arg Thr Leu Gln Asn Val Tyr Ile Asp Leu565 570 575 tca aca ttt ttt cat tgg ttt act aaa cca ag t tgt cac ata aatatg 1776 Ser Thr Phe Phe His Trp Phe Thr Lys Pro Se r Cys His Ile AsnMet 580 585 590 agt taa cgc ctt ttt ttt tat aat att gta ta t gaa ttt aaactt gag 1824 Ser Arg Leu Phe Phe Tyr Asn Ile Val Tyr Glu Phe Lys Leu Glu595 600 605 ctg tca aac gtc aag caa acc caa cat cta ca t aca tat agt actata 1872 Leu Ser Asn Val Lys Gln Thr Gln His Leu Hi s Thr Tyr Ser ThrIle 610 615 620 ttt tga aaa tta aaa ttt tct taa att tcc ca t att att ttcctt tta 1920 Phe Lys Leu Lys Phe Ser Ile Ser His Ile Ile Phe Leu Leu 6256 30 6 35 6 40 aag caa gca agt cca aat acg ttt ctt cca ga t tat aat tttcct taa 1968 Lys Gln Ala Ser Pro Asn Thr Phe Leu Pro As p Tyr Asn PhePro 645 650 655 taa ggt ttt cta caa aaa aaa atc aac ttc tt a ttt aaa aaaccc ttt 2016 Gly Phe Leu Gln Lys Lys Ile Asn Phe Leu Phe Lys Lys Pro Phe660 665 670 gca tta tcc ttt tca cca aca tca gag aag ac g aga aaa aaa gaagag 2064 Ala Leu Ser Phe Ser Pro Thr Ser Glu Lys Th r Arg Lys Lys GluGlu 675 680 685 gcg agt ggt taa tgg aga agg tta gtt tca ct c caa aca tatatg aat 2112 Ala Ser Gly Trp Arg Arg Leu Val Ser Leu Gln Thr Tyr Met Asn690 695 700 tga cta ggt tat gaa atc cat ata ttt taa tt g tgt gtt tat gataga 2160 Leu Gly Tyr Glu Ile His Ile Phe Leu Cys Val Tyr Asp Arg 705 710 7 15 7 20 tca ata aca ttt agg gtt gaa ttt tct tgt ga t cta tta tgttat tcg 2208 Ser Ile Thr Phe Arg Val Glu Phe Ser Cys As p Leu Leu CysTyr Ser 725 730 735 tcc cat gca tga tcc ata aaa ctt tta ttt tt g aat ttgtct agg aaa 2256 Ser His Ala Ser Ile Lys Leu Leu Phe Leu Asn Leu Ser ArgLys 740 745 750 acc atg agg acg atg gtg agg gtt tgc cac cc g aac taa atcaga taa 2304 Thr Met Arg Thr Met Val Arg Val Cys His Pr o Asn Ile Arg755 760 765 aag agc aaa tcg aaa agg aga gat ttc tgc at a tca agg taa gagaca 2352 Lys Ser Lys Ser Lys Arg Arg Asp Phe Cys Il e Ser Arg Glu Thr770 775 780 ttt ggt tgc ttt aat att tta ttc tct tct gt a tgt ttt tct gaaaat 2400 Phe Gly Cys Phe Asn Ile Leu Phe Ser Ser Va l Cys Phe Ser GluAsn 785 7 90 7 95 8 00 taa gga gag gag agg act taa tct cat aac ta t acgatt cca aag aga 2448 Gly Glu Glu Arg Thr Ser His Asn Tyr Thr Ile Pro LysArg 805 810 815 tgt taa gat aca tct aat aaa cag tta tac at t agt cat aatctt taa 2496 Cys Asp Thr Ser Asn Lys Gln Leu Tyr Ile Ser His Asn Leu 820825 830 aac taa aaa gag aaa ttt cca aac ttt taa at t aaa aac aga att tag2544 Asn Lys Glu Lys Phe Pro Asn Phe Ile Lys Asn Arg Ile 835 840 845 aaaatg cca gcg aat cga taa cga cat cca ga t ctg tcg ggt atc caa 2592 LysMet Pro Ala Asn Arg Arg His Pro Asp Leu Ser Gly Ile Gln 850 855 860 aactta gaa taa aaa aat aat taa tat att ta t aat ata aag ctg gaa 2640 AsnLeu Glu Lys Asn Asn Tyr Ile Tyr Asn Ile Lys Leu Glu 865 8 70 8 75 8 80ctt agg tta taa aat aaa att gaa aat aat ag t aga ttt ttt tgt ttt 2688Leu Arg Leu Asn Lys Ile Glu Asn Asn Ser Arg Phe Phe Cys Phe 885 890 895tgt caa aca aaa tag taa tac aat ttg ttt tt t tta gta caa aga aac 2736Cys Gln Thr Lys Tyr Asn Leu Phe Phe Leu Val Gln Arg Asn 900 905 910 taaata ggt cca aat tgt ttt ttt ttt aac at t cag cca aaa aag cca 2784 IleGly Pro Asn Cys Phe Phe Phe Asn Ile Gln Pro Lys Lys Pro 915 920 925 agattg atg cat ata tca aga aat cga aat ca a aac ttt tgt att caa 2832 ArgLeu Met His Ile Ser Arg Asn Arg Asn Gl n Asn Phe Cys Ile Gln 930 935 940gta ttc tag ttt cac tat ata tag agt cca gt t tct gaa att taa aaa 2880Val Phe Phe His Tyr Ile Ser Pro Val Ser Glu Ile Lys 945 9 50 9 55 9 60atc att tac cta tat att act tga tta aca ga g aaa att cga gct gag 2928Ile Ile Tyr Leu Tyr Ile Thr Leu Thr Glu Lys Ile Arg Ala Glu 965 970 975ata cat tcc aag tgt ggc tac tca tgc ttc ac a cca tca atc gtt tga 2976Ile His Ser Lys Cys Gly Tyr Ser Cys Phe Th r Pro Ser Ile Val 980 985 990ctt aaa cca gcc cgc tgc aga gga tga taa tg g agg aga caa caa atc 3024Leu Lys Pro Ala Arg Cys Arg Gly Trp Arg Arg Gln Gln Ile 995 1000 1005act ttt gtc gag aat gca aaa ccc act tcg tc a ttt cag tgc ctc atc 3072Thr Phe Val Glu Asn Ala Lys Pro Thr Ser Se r Phe Gln Cys Leu Ile 10101015 1020 tga tta taa ttc tta cga aga tca agg tta tg t tct tga tga ggatca 3120 Leu Phe Leu Arg Arg Ser Arg Leu Cys Ser Gly Ser 1025 1030 10351040 aga tta tgc tct tga aga aga tgt acc att at t tct tga tga aga tgt3168 Arg Leu Cys Ser Arg Arg Cys Thr Ile Ile Ser Arg Cys 1045 1050 1055acc att att acc aag tgt caa gct tcc aat tg t tga gaa gct acc acg 3216Thr Ile Ile Thr Lys Cys Gln Ala Ser Asn Cy s Glu Ala Thr Thr 1060 10651070 atc cat tac atg ggt ctt cac caa aag gca tg t gtg ttt ttt gtt tcg3264 Ile His Tyr Met Gly Leu His Gln Lys Ala Cy s Val Phe Phe Val Ser1075 1080 1085 tac tag ttt caa aat att aat cat ata cta ta t agt aat cactca tag 3312 Tyr Phe Gln Asn Ile Asn His Ile Leu Tyr Ser Asn His Ser1090 1095 1100 tgc ata tat aca ttt ctt taa cat tgc agt ag c cag ctg atggct gaa 3360 Cys Ile Tyr Thr Phe Leu His Cys Ser Ser Gln Leu Met Ala Glu1105 1110 1115 1120 agt gat tct gtg att ggt aag aga caa atc ta t tat ttgaat ggt gag 3408 Ser Asp Ser Val Ile Gly Lys Arg Gln Ile Ty r Tyr LeuAsn Gly Glu 1125 1130 1135 gca cta gaa ttg agc agt gaa gaa gat gag ga agat gaa gaa gaa gat 3456 Ala Leu Glu Leu Ser Ser Glu Glu Asp Glu Gl uAsp Glu Glu Glu Asp 1140 1145 1150 gag gaa gaa atc aag aaa gaa aaa tgcgaa tt t tct gaa gat gta gac 3504 Glu Glu Glu Ile Lys Lys Glu Lys CysGlu Ph e Ser Glu Asp Val Asp 1155 1160 1165 cga ttt ata tgg tta gtt tttgca tta cat at g ttc ttg att att aat 3552 Arg Phe Ile Trp Leu Val PheAla Leu His Me t Phe Leu Ile Ile Asn 1170 1175 1180 ttg tag tcc ata tttaat aaa ctg ctc aag aa a ttt tca gga cgg ttg 3600 Leu Ser Ile Phe AsnLys Leu Leu Lys Lys Phe Ser Gly Arg Leu 1185 1190 1195 1200 ggc agg actatg gtt tgg atg atc tgg tcg tg c ggc gtg ctc tcg cca 3648 Gly Arg ThrMet Val Trp Met Ile Trp Ser Cy s Gly Val Leu Ser Pro 1205 1210 1215 agtacc tcg aag tgg atg ttt cgg aca tat tg g taa caa tat tcg aat 3696 SerThr Ser Lys Trp Met Phe Arg Thr Tyr Tr p Gln Tyr Ser Asn 1220 1225 1230aaa aac ttc ata cgt cga tca ata act ttc ct g ctt att taa ttt ttg 3744Lys Asn Phe Ile Arg Arg Ser Ile Thr Phe Le u Leu Ile Phe Leu 1235 12401245 ttg ttt ttc gtc gtg aga aat gtt tta aat tt t caa atc taa tgt agg3792 Leu Phe Phe Val Val Arg Asn Val Leu Asn Ph e Gln Ile Cys Arg 12501255 1260 aaa gat aca atg aac tca agc tta aga atg at g gaa ctg ctg gtgagg 3840 Lys Asp Thr Met Asn Ser Ser Leu Arg Met Me t Glu Leu Leu ValArg 1265 1270 1275 1280 ctt ctg att tga cat cca aga caa taa cta ct g ctttcc agg att ttg 3888 Leu Leu Ile His Pro Arg Gln Leu Leu Leu Ser Arg IleLeu 1285 1290 1295 ctg ata gac gtc att gcc gtc gtt gca tgg ta a ctt tgaatc ttt ctt 3936 Leu Ile Asp Val Ile Ala Val Val Ala Trp Leu Ile Phe Leu1300 1305 1310 ttt taa ttt agc cac aaa aaa ggg aga tga tc a tac atg ttttta ttt 3984 Phe Phe Ser His Lys Lys Gly Arg Ser Tyr Met Phe Leu Phe1315 1320 1325 tat ttt atc att tgt ttt aca gat att cga tt g tca tat gcatga gaa 4032 Tyr Phe Ile Ile Cys Phe Thr Asp Ile Arg Le u Ser Tyr AlaGlu 1330 1335 1340 gta tga gcc cga gtc tag atc cgt aag cat ta a att cattta aat tat 4080 Val Ala Arg Val Ile Arg Lys His Ile His Leu Asn Tyr1345 1350 1355 1360 ttt gtt agt ttc aca acc ctt ata tat aag gt t aag tgatta act taa 4128 Phe Val Ser Phe Thr Thr Leu Ile Tyr Lys Va l Lys LeuThr 1365 1370 1375 tta gat tgc ttt ggc ttg tca gag cga aga ca a atc tagttt gtt tga 4176 Leu Asp Cys Phe Gly Leu Ser Glu Arg Arg Gl n Ile PheVal 1380 1385 1390 gga tga aga tag aca acc atg cag tga gca tt g tta cctcaa ggt ctc 4224 Gly Arg Thr Thr Met Gln Ala Leu Leu Pro Gln Gly Leu1395 1400 1405 tat ctc tct ccc tct ctc tct caa ttt ttt tg t cta ttc cttaat tac 4272 Tyr Leu Ser Pro Ser Leu Ser Gln Phe Phe Cy s Leu Phe LeuAsn Tyr 1410 1415 1420 gtt tat tag tta ctg gtt taa tat taa ata gg t gaggag tgt gac aga 4320 Val Tyr Leu Leu Val Tyr Ile Gly Glu Glu Cys Asp Arg1425 1430 1435 1440 agc tga tca tgt gat gga taa tga taa ctc ta t atc aaacaa gat tgt 4368 Ser Ser Cys Asp Gly Leu Tyr Ile Lys Gln Asp Cys 14451450 1455 ggt ctc aga tcc aaa caa cac tat gtg gac gc c tgt aga gaa ggatct 4416 Gly Leu Arg Ser Lys Gln His Tyr Val Asp Al a Cys Arg Glu GlySer 1460 1465 1470 tta ctt gaa agg aat tga gat att tgg gag aa a cag gtaaaa aaa taa 4464 Leu Leu Glu Arg Asn Asp Ile Trp Glu Lys Gln Val Lys Lys1475 1480 1485 aaa tag att taa tgc att aat ata tat act ta c act gta ttcctt gat 4512 Lys Ile Cys Ile Asn Ile Tyr Thr Tyr Thr Val Phe Leu Asp1490 1495 1500 tat gct ggt tcg cag ttg tga tgt tgc att aa a cat act tcgggg gct 4560 Tyr Ala Gly Ser Gln Leu Cys Cys Ile Lys His Thr Ser Gly Ala1505 1510 1515 1520 taa gac gtg cct aga gat tta caa tta cat gc g cga acaaga tca atg 4608 Asp Val Pro Arg Asp Leu Gln Leu His Ala Arg Thr Arg SerMet 1525 1530 1535 tac tat gtc att aga cct taa caa aac tac ac a aag acacaa tca ggt 4656 Tyr Tyr Val Ile Arg Pro Gln Asn Tyr Thr Lys Thr Gln SerGly 1540 1545 1550 aca cta acc tat gtc gta att att ctc atg ac a tgt atgtta aaa aca 4704 Thr Leu Thr Tyr Val Val Ile Ile Leu Met Th r Cys MetLeu Lys Thr 1555 1560 1565 cat gaa gtt tcc tat atg tgt tga tgg ttt ta tcac agg tta cca aaa 4752 His Glu Val Ser Tyr Met Cys Trp Phe Tyr His ArgLeu Pro Lys 1570 1575 1580 aag tat ctc gaa aaa gta gta ggt cgg tcc gc aaaa aat cga gac tcc 4800 Lys Tyr Leu Glu Lys Val Val Gly Arg Ser Al aLys Asn Arg Asp Ser 1585 1590 1595 1600 gaa aat atg ctc gtt atc cgc ctgctt taa ag a aaa caa cta gtg gag 4848 Glu Asn Met Leu Val Ile Arg LeuLeu Arg Lys Gln Leu Val Glu 1605 1610 1615 aag cta agt ttt ata agc actaca cac cat gc a ctt gca agt caa aat 4896 Lys Leu Ser Phe Ile Ser ThrThr His His Al a Leu Ala Ser Gln Asn 1620 1625 1630 gtg gac agc aat gccctt gtt taa ctc acg aa a att gct gcg aga aat 4944 Val Asp Ser Asn AlaLeu Val Leu Thr Lys Ile Ala Ala Arg Asn 1635 1640 1645 att gcg ggt atgtca ttc aat ttt tcc taa gc c gga aga tcc atg aga 4992 Ile Ala Gly MetSer Phe Asn Phe Ser Ala Gly Arg Ser Met Arg 1650 1655 1660 ttt aat ttgaac atg agt ttg tat ttt ttg tt c agg tgc tca aag gat 5040 Phe Asn LeuAsn Met Ser Leu Tyr Phe Leu Ph e Arg Cys Ser Lys Asp 1665 1670 1675 1680tgc aac aat cgc ttt gga gga tgt aat tgt gc a att ggc caa tgc aca 5088Cys Asn Asn Arg Phe Gly Gly Cys Asn Cys Al a Ile Gly Gln Cys Thr 16851690 1695 aat cga caa tgt cct tgt ttt gct gct aat cg t gaa tgc gat ccagat 5136 Asn Arg Gln Cys Pro Cys Phe Ala Ala Asn Ar g Glu Cys Asp ProAsp 1700 1705 1710 ctt tgt cgg agt tgt cct ctt agg taa cac tt t cac ttcaat atc tct 5184 Leu Cys Arg Ser Cys Pro Leu Arg His Phe His Phe Asn IleSer 1715 1720 1725 tta tac aaa ttc tat aat caa agt aat tca aa c caa aagtct tat aaa 5232 Leu Tyr Lys Phe Tyr Asn Gln Ser Asn Ser As n Gln LysSer Tyr Lys 1730 1735 1740 aaa aac ttt ata tat agc tgt gga gat ggc ac tctt ggt gag aca cca 5280 Lys Asn Phe Ile Tyr Ser Cys Gly Asp Gly Th rLeu Gly Glu Thr Pro 1745 1750 1755 1760 gtg caa atc caa tgc aag aac atgcaa ttc ct c ctt caa acc aat aaa 5328 Val Gln Ile Gln Cys Lys Asn MetGln Phe Le u Leu Gln Thr Asn Lys 1765 1770 1775 aag gta atc aac gtc aaatcc gta ccg aaa at t taa aac taa tta tac 5376 Lys Val Ile Asn Val LysSer Val Pro Lys Il e Asn Leu Tyr 1780 1785 1790 gaa aga cat tta act atcatt tcc cgt att tt a cta gat tct cat tgg 5424 Glu Arg His Leu Thr IleIle Ser Arg Ile Le u Leu Asp Ser His Trp 1795 1800 1805 aaa gtc tga tgttca tgg atg ggg tgc att ta c atg ggt aag caa tca 5472 Lys Val Cys SerTrp Met Gly Cys Ile Tyr Met Gly Lys Gln Ser 1810 1815 1820 tgt aaa tataag aat aag ttt aat agt tat tg g tgc att cat aac act 5520 Cys Lys TyrLys Asn Lys Phe Asn Ser Tyr Tr p Cys Ile His Asn Thr 1825 1830 1835 1840ttt ttt ttt tta ata atg ttt tat act tta ga c cat taa ata tat tgt 5568Phe Phe Phe Leu Ile Met Phe Tyr Thr Leu As p His Ile Tyr Cys 1845 18501855 gtg ata tgg ttt gac ccg tca gga ctc tct ta a aaa gaa tga gta tct5616 Val Ile Trp Phe Asp Pro Ser Gly Leu Ser Lys Glu Val Ser 1860 18651870 cgg aga ata tac tgg aga act gat cac tca tg a tga agc taa tga gcg5664 Arg Arg Ile Tyr Trp Arg Thr Asp His Ser Ser Ala 1875 1880 1885 tgggag aat aga aga tcg gat tgg ttc ttc ct a cct ctt tac ctt gaa 5712 TrpGlu Asn Arg Arg Ser Asp Trp Phe Phe Le u Pro Leu Tyr Leu Glu 1890 18951900 tga tca ggt aac ttc aga ata att ttg aag ta a cgt ttt aat cat tcg5760 Ser Gly Asn Phe Arg Ile Ile Leu Lys Arg Phe Asn His Ser 1905 19101915 1920 cgg gtt aca cat cta ttc gaa tca aag taa ca t tta ttt tac agctcg 5808 Arg Val Thr His Leu Phe Glu Ser Lys His Leu Phe Tyr Ser Ser1925 1930 1935 aaa tcg atg ctc gcc gta aag gaa acg agt tc a aat ttc tcaatc act 5856 Lys Ser Met Leu Ala Val Lys Glu Thr Ser Se r Asn Phe SerIle Thr 1940 1945 1950 cag caa gac cta act gct acg cca agg tac ta a gccgtt ata ctt tat 5904 Gln Gln Asp Leu Thr Ala Thr Pro Arg Tyr Ala Val IleLeu Tyr 1955 1960 1965 ctt gaa caa ata cta aca tta tac aaa caa aa a tactta tgt tag ttt 5952 Leu Glu Gln Ile Leu Thr Leu Tyr Lys Gln Ly s TyrLeu Cys Phe 1970 1975 1980 ctt tag tta aat cgt gta tca act tta ctc gt cgtt gat tgg ttt tca 6000 Leu Leu Asn Arg Val Ser Thr Leu Leu Val Val AspTrp Phe Ser 1985 1990 1995 2000 tat tga aga tat tcc aag aaa ctc aaa ctcat t tta aat gat ttt ttc 6048 Tyr Arg Tyr Ser Lys Lys Leu Lys Leu IleLeu Asn Asp Phe Phe 2005 2010 2015 ttg tcg aga aaa ttt agg tta cga aaattt at g gtt tcg tgt gca gtt 6096 Leu Ser Arg Lys Phe Arg Leu Arg LysPhe Me t Val Ser Cys Ala Val 2020 2025 2030 gat gat tgt gag agg aga tcagag gat tgg tc t att tgc gga gag agc 6144 Asp Asp Cys Glu Arg Arg SerGlu Asp Trp Se r Ile Cys Gly Glu Ser 2035 2040 2045 aat cga aga agg tgagga gct ttt ctt cga ct a ctg cta tgg acc aga 6192 Asn Arg Arg Arg GlyAla Phe Leu Arg Leu Leu Leu Trp Thr Arg 2050 2055 2060 aca tgc gga ttggtc gcg tgg tcg aga acc ta g aaa gac tgg tgc ttc 6240 Thr Cys Gly LeuVal Ala Trp Ser Arg Thr Lys Asp Trp Cys Phe 2065 2070 2075 2080 taa aaggtc taa gga agc ccg tcc agc tcg tt a gtt ttt gat ctg agg 6288 Lys ValGly Ser Pro Ser Ser Ser Leu Val Phe Asp Leu Arg 2085 2090 2095 aga agcagc aat tca agc agt cct ttt ttt at g tta tgg tat atc aat 6336 Arg SerSer Asn Ser Ser Ser Pro Phe Phe Me t Leu Trp Tyr Ile Asn 2100 2105 2110taa taa tgt aat gct att ttg tgt tac taa ac c aaa act taa gtt tct 6384Cys Asn Ala Ile Leu Cys Tyr Thr Lys Thr Val Ser 2115 2120 2125 gtt ttattt gtt tta ggg tgt ttt gtt tgt at c ata tgt gtc tta act 6432 Val LeuPhe Val Leu Gly Cys Phe Val Cys Il e Ile Cys Val Leu Thr 2130 2135 2140ttc aaa gtt ttc ttt ttg tat ttc aat tta aa a aca atg ttt atg ttg 6480Phe Lys Val Phe Phe Leu Tyr Phe Asn Leu Ly s Thr Met Phe Met Leu 21452150 2155 2160 tta gtt tgc ata gac ctt tgg aaa aaa aaa gc t ttg cac aacttt aca 6528 Leu Val Cys Ile Asp Leu Trp Lys Lys Lys Al a Leu His AsnPhe Thr 2165 2170 2175 ttt att tag tct tca ttt agc gaa aaa tca ca t aacaca agt ctg tgg 6576 Phe Ile Ser Ser Phe Ser Glu Lys Ser His Asn Thr SerLeu Trp 2180 2185 2190 tac gta atg tac aaa aat gtc aaa ata atg gg t tttatc att aaa aaa 6624 Tyr Val Met Tyr Lys Asn Val Lys Ile Met Gl y PheIle Ile Lys Lys 2195 2200 2205 aaa tat tgg tta tga atg aag tat agt tagaa t ttt agg tat tag ctc 6672 Lys Tyr Trp Leu Met Lys Tyr Ser Asn PheArg Tyr Leu 2210 2215 2220 gtt tgg ttt taa aac gtt ttt cga gat tta at tttg tag tct att gag 6720 Val Trp Phe Asn Val Phe Arg Asp Leu Ile Leu SerIle Glu 2225 2230 2235 2240 taa tac atg gaa gaa tca tca aca aag tgg ct gtag ctt acg aaa ggt 6768 Tyr Met Glu Glu Ser Ser Thr Lys Trp Leu Leu ThrLys Gly 2245 2250 2255 ttt act tta atg taa ata tgt att tga tgc at c taacat tta gta tct 6816 Phe Thr Leu Met Ile Cys Ile Cys Ile His Leu Val Ser2260 2265 2270 aaa caa ata aaa aca aaa aaa aag aaa aaa gc t ctt taa aatccg aaa 6864 Lys Gln Ile Lys Thr Lys Lys Lys Lys Lys Al a Leu Asn ProLys 2275 2280 2285 gta act att ttc aaa aaa tct aaa tta taa ac t taa atgttt gga atc 6912 Val Thr Ile Phe Lys Lys Ser Lys Leu Thr Met Phe Gly Ile2290 2295 2300 gcg aac gac tat tgc taa ata taa atg cta aa t ata cat gaagat gtg 6960 Ala Asn Asp Tyr Cys Ile Met Leu Asn Ile His Glu Asp Val2305 2310 2315 2320 aaa aac atg ttg gat ttg tgg aat cgt taa tg a cca cggtta aat ggc 7008 Lys Asn Met Leu Asp Leu Trp Asn Arg Pro Arg Leu Asn Gly2325 2330 2335 ggg atc c 7015 Gly Ile <210> SEQ ID NO: 7 <211> LENGTH:34 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 7 GlySer Ile Ile Phe Lys Asn Gln Ile Phe Se r Tyr Leu Leu Phe Val 1 5 10 15Ser Lys Lys Lys Lys Thr His Asp Asp Tyr Pr o Ser Ala Gly Cys Val 20 2530 His Arg <210> SEQ ID NO: 8 <211> LENGTH: 12 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 8 Thr Tyr Ile Leu Lys Leu ValGly Phe Ser Le u Pro 1 5 10 <210> SEQ ID NO: 9 <211> LENGTH: 18 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 9 Val Trp ThrCys Phe Tyr Asn Leu Met Tyr Se r Val Asp Gln Lys Ile 1 5 10 15 Glu Lys<210> SEQ ID NO: 10 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 10 Glu Arg Glu Pro Leu Trp 1 5 <210> SEQID NO: 11 <211> LENGTH: 20 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 11 Gln Asn Arg Asn His Tyr Ile Glu Ser Phe Gl u LysThr Lys Arg Ser 1 5 10 15 Asn Leu Cys Ser 20 <210> SEQ ID NO: 12 <211>LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 12 Met Thr Ile Asp Val Ala Ala Asn Tyr Ser Le u Asn Ala PheIle 1 5 10 15 <210> SEQ ID NO: 13 <211> LENGTH: 307 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 13 Ile Phe Leu Thr SerSer Val Pro Ser His Se r Arg Asn Ser Ile Ile 1 5 10 15 Pro Phe Ser PhePhe Phe Ser Val Phe Gln Se r Leu Arg Ile Lys Met 20 25 30 Glu His GluGlu Thr Gln Lys Asn Thr Arg As n Ser Trp Ser Leu Ile 35 40 45 Arg ProPhe Gln Met Ile Ser Ile Ser Phe Le u Ser Leu Leu Leu Pro 50 55 60 LeuSer Phe Leu Phe Leu Ser Arg Leu Ser Le u Tyr Thr Ser Ser Thr 65 70 75 80Pro Val Thr Val Ser Gly Val Ser Ser Val Il e His Gln Ala Asp Val 85 9095 Gly Val Leu Tyr Thr Ile Leu Phe Leu Ile Il e Val Phe Thr Leu Ile 100105 110 His Ser Leu Ser Gly Lys Pro Glu Cys Ser Va l Leu His Ser His Leu115 120 125 Tyr Ile Cys Trp Ile Val Leu Phe Ile Ala Gl n Ala Cys Ala PheGly 130 135 140 Ile Lys Arg Thr Met Ser Thr Thr Met Ser Il e Asn Pro AspLys Asn 145 1 50 1 55 1 60 Leu Phe Leu Ala Thr His Glu Arg Trp Met Le uVal Arg Val Leu Phe 165 170 175 Phe Leu Gly Leu His Glu Val Met Leu MetTr p Phe Arg Val Val Val 180 185 190 Lys Pro Val Val Asp Asn Thr Ile TyrGly Va l Tyr Val Glu Glu Arg 195 200 205 Trp Ser Glu Arg Ala Val Val AlaVal Thr Ph e Gly Ile Met Trp Trp 210 215 220 Trp Arg Leu Arg Asp Glu ValGlu Ser Leu Va l Val Val Val Thr Ala 225 2 30 2 35 2 40 Asp Arg Leu AsnLeu Pro Ile Arg Leu Glu Gl y Leu Asn Phe Val Asn 245 250 255 Trp Cys MetTyr Tyr Ile Cys Val Gly Ile Gl y Leu Met Lys Ile Phe 260 265 270 Lys GlyPhe Leu Asp Phe Val Asn Thr Leu Th r Leu Ser Ile Lys Arg 275 280 285 SerArg Lys Gly Cys Glu Ser Cys Val Phe As p Asp Met Cys Asn Asp 290 295 300Asp His Val 305 <210> SEQ ID NO: 14 <211> LENGTH: 6 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 14 His Ile Ile Leu IleSer 1 5 <210> SEQ ID NO: 15 <211> LENGTH: 42 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 15 Met Phe Leu Arg Phe Phe TyrPhe Tyr Phe Le u Phe Leu Ala Arg Asn 1 5 10 15 Leu Thr Arg Ile Tyr ValThr Lys Ile Val Gl u Tyr Gln Lys Ala Lys 20 25 30 Ile Phe Tyr Leu LysIle Thr Ile Glu His 35 40 <210> SEQ ID NO: 16 <211> LENGTH: 14 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 16 Phe Lys SerPhe Tyr Asn Tyr Ile Phe Ile Th r His Pro Phe 1 5 10 <210> SEQ ID NO: 17<211> LENGTH: 26 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 17 Glu Lys Leu Gly Asp Leu Ile Asn Val Ile As n Ser Lys LysTyr Arg 1 5 10 15 Ile Tyr Val Glu Val Leu Asn Ala Tyr Asn 20 25 <210>SEQ ID NO: 18 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 18 Ile Tyr Glu Leu Asn Asn Ile Ala IleTyr Il e Phe Leu Lys Ile 1 5 10 15 <210> SEQ ID NO: 19 <211> LENGTH: 9<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 19 ThrHis Phe Val Ser Ser Ile Tyr Ala 1 5 <210> SEQ ID NO: 20 <211> LENGTH: 13<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 20 TyrIle Ser Leu Asn Arg Lys Leu Ala Arg As n Glu Tyr 1 5 10 <210> SEQ ID NO:21 <211> LENGTH: 31 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 21 Tyr Lys Ser Tyr Arg Thr Leu Gln Asn Val Ty r Ile AspLeu Ser Thr 1 5 10 15 Phe Phe His Trp Phe Thr Lys Pro Ser Cys Hi s IleAsn Met Ser 20 25 30 <210> SEQ ID NO: 22 <211> LENGTH: 31 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 22 Arg Leu Phe PheTyr Asn Ile Val Tyr Glu Ph e Lys Leu Glu Leu Ser 1 5 10 15 Asn Val LysGln Thr Gln His Leu His Thr Ty r Ser Thr Ile Phe 20 25 30 <210> SEQ IDNO: 23 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 23 Lys Leu Lys Phe Ser 1 5 <210> SEQ ID NO: 24 <211>LENGTH: 23 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 24 Ile Ser His Ile Ile Phe Leu Leu Lys Gln Al a Ser Pro AsnThr Phe 1 5 10 15 Leu Pro Asp Tyr Asn Phe Pro 20 <210> SEQ ID NO: 25<211> LENGTH: 34 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 25 Gly Phe Leu Gln Lys Lys Ile Asn Phe Leu Ph e Lys Lys ProPhe Ala 1 5 10 15 Leu Ser Phe Ser Pro Thr Ser Glu Lys Thr Ar g Lys LysGlu Glu Ala 20 25 30 Ser Gly <210> SEQ ID NO: 26 <211> LENGTH: 12 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 26 Trp Arg ArgLeu Val Ser Leu Gln Thr Tyr Me t Asn 1 5 10 <210> SEQ ID NO: 27 <211>LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 27 Leu Gly Tyr Glu Ile His Ile Phe 1 5 <210> SEQ ID NO: 28<211> LENGTH: 25 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 28 Leu Cys Val Tyr Asp Arg Ser Ile Thr Phe Ar g Val Glu PheSer Cys 1 5 10 15 Asp Leu Leu Cys Tyr Ser Ser His Ala 20 25 <210> SEQ IDNO: 29 <211> LENGTH: 24 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 29 Ser Ile Lys Leu Leu Phe Leu Asn Leu Ser Ar g Lys ThrMet Arg Thr 1 5 10 15 Met Val Arg Val Cys His Pro Asn 20 <210> SEQ IDNO: 30 <211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 30 Lys Ser Lys Ser Lys Arg Arg Asp Phe Cys Il e Ser Arg1 5 10 <210> SEQ ID NO: 31 <211> LENGTH: 18 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 31 Glu Thr Phe Gly Cys Phe AsnIle Leu Phe Se r Ser Val Cys Phe Ser 1 5 10 15 Glu Asn <210> SEQ ID NO:32 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 32 Gly Glu Glu Arg Thr 1 5 <210> SEQ ID NO: 33 <211> LENGTH:10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 33Ser His Asn Tyr Thr Ile Pro Lys Arg Cys 1 5 10 <210> SEQ ID NO: 34 <211>LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 34 Asp Thr Ser Asn Lys Gln Leu Tyr Ile Ser Hi s Asn Leu 1 5 10<210> SEQ ID NO: 35 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 35 Lys Glu Lys Phe Pro Asn Phe 1 5 <210>SEQ ID NO: 36 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 36 Ile Lys Asn Arg Ile 1 5 <210> SEQ IDNO: 37 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 37 Lys Met Pro Ala Asn Arg 1 5 <210> SEQ ID NO: 38 <211>LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 38 Arg His Pro Asp Leu Ser Gly Ile Gln Asn Le u Glu 1 5 10<210> SEQ ID NO: 39 <211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 39 Tyr Ile Tyr Asn Ile Lys Leu Glu LeuArg Le u 1 5 10 <210> SEQ ID NO: 40 <211> LENGTH: 16 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 40 Asn Lys Ile Glu AsnAsn Ser Arg Phe Phe Cy s Phe Cys Gln Thr Lys 1 5 10 15 <210> SEQ ID NO:41 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 41 Tyr Asn Leu Phe Phe Leu Val Gln Arg Asn 1 5 10 <210>SEQ ID NO: 42 <211> LENGTH: 33 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 42 Ile Gly Pro Asn Cys Phe Phe Phe AsnIle Gl n Pro Lys Lys Pro Arg 1 5 10 15 Leu Met His Ile Ser Arg Asn ArgAsn Gln As n Phe Cys Ile Gln Val 20 25 30 Phe <210> SEQ ID NO: 43 <211>LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 43 Phe His Tyr Ile 1 <210> SEQ ID NO: 44 <211> LENGTH: 6 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 44 Ser Pro ValSer Glu Ile 1 5 <210> SEQ ID NO: 45 <211> LENGTH: 8 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 45 Lys Ile Ile Tyr LeuTyr Ile Thr 1 5 <210> SEQ ID NO: 46 <211> LENGTH: 23 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 46 Leu Thr Glu Lys IleArg Ala Glu Ile His Se r Lys Cys Gly Tyr Ser 1 5 10 15 Cys Phe Thr ProSer Ile Val 20 <210> SEQ ID NO: 47 <211> LENGTH: 8 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 47 Leu Lys Pro Ala Arg Cys ArgGly 1 5 <210> SEQ ID NO: 48 <211> LENGTH: 22 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 48 Trp Arg Arg Gln Gln Ile ThrPhe Val Glu As n Ala Lys Pro Thr Ser 1 5 10 15 Ser Phe Gln Cys Leu Ile20 <210> SEQ ID NO: 49 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 49 Phe Leu Arg Arg Ser Arg Leu Cys Ser 15 <210> SEQ ID NO: 50 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 50 Gly Ser Arg Leu Cys Ser 1 5 <210> SEQID NO: 51 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 51 Arg Arg Cys Thr Ile Ile Ser 1 5 <210> SEQ ID NO:52 <211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 52 Arg Cys Thr Ile Ile Thr Lys Cys Gln Ala Se r Asn Cys1 5 10 <210> SEQ ID NO: 53 <211> LENGTH: 21 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 53 Glu Ala Thr Thr Ile His TyrMet Gly Leu Hi s Gln Lys Ala Cys Val 1 5 10 15 Phe Phe Val Ser Tyr 20<210> SEQ ID NO: 54 <211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 54 Phe Gln Asn Ile Asn His Ile Leu TyrSer As n His Ser 1 5 10 <210> SEQ ID NO: 55 <211> LENGTH: 6 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 55 Cys Ile Tyr ThrPhe Leu 1 5 <210> SEQ ID NO: 56 <211> LENGTH: 74 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 56 His Cys Ser Ser Gln Leu MetAla Glu Ser As p Ser Val Ile Gly Lys 1 5 10 15 Arg Gln Ile Tyr Tyr LeuAsn Gly Glu Ala Le u Glu Leu Ser Ser Glu 20 25 30 Glu Asp Glu Glu AspGlu Glu Glu Asp Glu Gl u Glu Ile Lys Lys Glu 35 40 45 Lys Cys Glu PheSer Glu Asp Val Asp Arg Ph e Ile Trp Leu Val Phe 50 55 60 Ala Leu HisMet Phe Leu Ile Ile Asn Leu 65 70 <210> SEQ ID NO: 57 <211> LENGTH: 41<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 57 SerIle Phe Asn Lys Leu Leu Lys Lys Phe Se r Gly Arg Leu Gly Arg 1 5 10 15Thr Met Val Trp Met Ile Trp Ser Cys Gly Va l Leu Ser Pro Ser Thr 20 2530 Ser Lys Trp Met Phe Arg Thr Tyr Trp 35 40 <210> SEQ ID NO: 58 <211>LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 58 Gln Tyr Ser Asn Lys Asn Phe Ile Arg Arg Se r Ile Thr PheLeu Leu 1 5 10 15 Ile <210> SEQ ID NO: 59 <211> LENGTH: 15 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 59 Phe Leu Leu PhePhe Val Val Arg Asn Val Le u Asn Phe Gln Ile 1 5 10 15 <210> SEQ ID NO:60 <211> LENGTH: 21 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 60 Cys Arg Lys Asp Thr Met Asn Ser Ser Leu Ar g Met MetGlu Leu Leu 1 5 10 15 Val Arg Leu Leu Ile 20 <210> SEQ ID NO: 61 <211>LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 61 His Pro Arg Gln 1 <210> SEQ ID NO: 62 <211> LENGTH: 17<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 62 LeuLeu Leu Ser Arg Ile Leu Leu Ile Asp Va l Ile Ala Val Val Ala 1 5 10 15Trp <210> SEQ ID NO: 63 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 63 Ile Phe Leu Phe 1 <210> SEQ ID NO: 64<211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 64 Phe Ser His Lys Lys Gly Arg 1 5 <210> SEQ ID NO: 65 <211>LENGTH: 20 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 65 Ser Tyr Met Phe Leu Phe Tyr Phe Ile Ile Cy s Phe Thr AspIle Arg 1 5 10 15 Leu Ser Tyr Ala 20 <210> SEQ ID NO: 66 <211> LENGTH: 4<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 66 IleArg Lys His 1 <210> SEQ ID NO: 67 <211> LENGTH: 17 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 67 Ile His Leu Asn Tyr Phe ValSer Phe Thr Th r Leu Ile Tyr Lys Val 1 5 10 15 Lys <210> SEQ ID NO: 68<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 68 Leu Asp Cys Phe Gly Leu Ser Glu Arg Arg Gl n Ile 1 5 10<210> SEQ ID NO: 69 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 69 Thr Thr Met Gln 1 <210> SEQ ID NO: 70<211> LENGTH: 25 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 70 Ala Leu Leu Pro Gln Gly Leu Tyr Leu Ser Pr o Ser Leu SerGln Phe 1 5 10 15 Phe Cys Leu Phe Leu Asn Tyr Val Tyr 20 25 <210> SEQ IDNO: 71 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 71 Ile Gly Glu Glu Cys Asp Arg Ser 1 5 <210> SEQ ID NO:72 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 72 Ser Cys Asp Gly 1 <210> SEQ ID NO: 73 <211> LENGTH: 28<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 73 LeuTyr Ile Lys Gln Asp Cys Gly Leu Arg Se r Lys Gln His Tyr Val 1 5 10 15Asp Ala Cys Arg Glu Gly Ser Leu Leu Glu Ar g Asn 20 25 <210> SEQ ID NO:74 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 74 Asp Ile Trp Glu Lys Gln Val Lys Lys 1 5 <210> SEQ ID NO: 75<211> LENGTH: 18 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 75 Cys Ile Asn Ile Tyr Thr Tyr Thr Val Phe Le u Asp Tyr AlaGly Ser 1 5 10 15 Gln Leu <210> SEQ ID NO: 76 <211> LENGTH: 9 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 76 Cys Cys IleLys His Thr Ser Gly Ala 1 5 <210> SEQ ID NO: 77 <211> LENGTH: 21 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 77 Asp Val ProArg Asp Leu Gln Leu His Ala Ar g Thr Arg Ser Met Tyr 1 5 10 15 Tyr ValIle Arg Pro 20 <210> SEQ ID NO: 78 <211> LENGTH: 32 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 78 Gln Asn Tyr Thr LysThr Gln Ser Gly Thr Le u Thr Tyr Val Val Ile 1 5 10 15 Ile Leu Met ThrCys Met Leu Lys Thr His Gl u Val Ser Tyr Met Cys 20 25 30 <210> SEQ IDNO: 79 <211> LENGTH: 33 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 79 Trp Phe Tyr His Arg Leu Pro Lys Lys Tyr Le u Glu LysVal Val Gly 1 5 10 15 Arg Ser Ala Lys Asn Arg Asp Ser Glu Asn Me t LeuVal Ile Arg Leu 20 25 30 Leu <210> SEQ ID NO: 80 <211> LENGTH: 29 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 80 Arg Lys GlnLeu Val Glu Lys Leu Ser Phe Il e Ser Thr Thr His His 1 5 10 15 Ala LeuAla Ser Gln Asn Val Asp Ser Asn Al a Leu Val 20 25 <210> SEQ ID NO: 81<211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 81 Leu Thr Lys Ile Ala Ala Arg Asn Ile Ala Gl y Met Ser PheAsn Phe 1 5 10 15 Ser <210> SEQ ID NO: 82 <211> LENGTH: 62 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 82 Ala Gly Arg SerMet Arg Phe Asn Leu Asn Me t Ser Leu Tyr Phe Leu 1 5 10 15 Phe Arg CysSer Lys Asp Cys Asn Asn Arg Ph e Gly Gly Cys Asn Cys 20 25 30 Ala IleGly Gln Cys Thr Asn Arg Gln Cys Pr o Cys Phe Ala Ala Asn 35 40 45 ArgGlu Cys Asp Pro Asp Leu Cys Arg Ser Cy s Pro Leu Arg 50 55 60 <210> SEQID NO: 83 <211> LENGTH: 66 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 83 His Phe His Phe Asn Ile Ser Leu Tyr Lys Ph e TyrAsn Gln Ser Asn 1 5 10 15 Ser Asn Gln Lys Ser Tyr Lys Lys Asn Phe Il eTyr Ser Cys Gly Asp 20 25 30 Gly Thr Leu Gly Glu Thr Pro Val Gln Ile Gln Cys Lys Asn Met Gln 35 40 45 Phe Leu Leu Gln Thr Asn Lys Lys Val IleAs n Val Lys Ser Val Pro 50 55 60 Lys Ile 65 <210> SEQ ID NO: 84 <211>LENGTH: 20 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 84 Leu Tyr Glu Arg His Leu Thr Ile Ile Ser Ar g Ile Leu LeuAsp Ser 1 5 10 15 His Trp Lys Val 20 <210> SEQ ID NO: 85 <211> LENGTH:41 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 85Cys Ser Trp Met Gly Cys Ile Tyr Met Gly Ly s Gln Ser Cys Lys Tyr 1 5 1015 Lys Asn Lys Phe Asn Ser Tyr Trp Cys Ile Hi s Asn Thr Phe Phe Phe 2025 30 Leu Ile Met Phe Tyr Thr Leu Asp His 35 40 <210> SEQ ID NO: 86<211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 86 Ile Tyr Cys Val Ile Trp Phe Asp Pro Ser Gl y Leu Ser 1 5 10<210> SEQ ID NO: 87 <211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 87 Val Ser Arg Arg Ile Tyr Trp Arg ThrAsp Hi s Ser 1 5 10 <210> SEQ ID NO: 88 <211> LENGTH: 17 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 88 Ala Trp Glu Asn ArgArg Ser Asp Trp Phe Ph e Leu Pro Leu Tyr Leu 1 5 10 15 Glu <210> SEQ IDNO: 89 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 89 Ser Gly Asn Phe Arg Ile Ile Leu Lys 1 5 <210> SEQ IDNO: 90 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 90 Arg Phe Asn His Ser Arg Val Thr His Leu Ph e Glu SerLys 1 5 10 <210> SEQ ID NO: 91 <211> LENGTH: 32 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 91 His Leu Phe Tyr Ser Ser LysSer Met Leu Al a Val Lys Glu Thr Ser 1 5 10 15 Ser Asn Phe Ser Ile ThrGln Gln Asp Leu Th r Ala Thr Pro Arg Tyr 20 25 30 <210> SEQ ID NO: 92<211> LENGTH: 19 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 92 Ala Val Ile Leu Tyr Leu Glu Gln Ile Leu Th r Leu Tyr LysGln Lys 1 5 10 15 Tyr Leu Cys <210> SEQ ID NO: 93 <211> LENGTH: 15 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 93 Leu Asn ArgVal Ser Thr Leu Leu Val Val As p Trp Phe Ser Tyr 1 5 10 15 <210> SEQ IDNO: 94 <211> LENGTH: 50 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 94 Arg Tyr Ser Lys Lys Leu Lys Leu Ile Leu As n Asp PhePhe Leu Ser 1 5 10 15 Arg Lys Phe Arg Leu Arg Lys Phe Met Val Se r CysAla Val Asp Asp 20 25 30 Cys Glu Arg Arg Ser Glu Asp Trp Ser Ile Cy sGly Glu Ser Asn Arg 35 40 45 Arg Arg 50 <210> SEQ ID NO: 95 <211>LENGTH: 21 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 95 Gly Ala Phe Leu Arg Leu Leu Leu Trp Thr Ar g Thr Cys GlyLeu Val 1 5 10 15 Ala Trp Ser Arg Thr 20 <210> SEQ ID NO: 96 <211>LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 96 Lys Asp Trp Cys Phe 1 5 <210> SEQ ID NO: 97 <211> LENGTH:28 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 97Gly Ser Pro Ser Ser Ser Leu Val Phe Asp Le u Arg Arg Ser Ser Asn 1 5 1015 Ser Ser Ser Pro Phe Phe Met Leu Trp Tyr Il e Asn 20 25 <210> SEQ IDNO: 98 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 98 Cys Asn Ala Ile Leu Cys Tyr 1 5 <210> SEQ ID NO: 99<211> LENGTH: 52 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 99 Val Ser Val Leu Phe Val Leu Gly Cys Phe Va l Cys Ile IleCys Val 1 5 10 15 Leu Thr Phe Lys Val Phe Phe Leu Tyr Phe As n Leu LysThr Met Phe 20 25 30 Met Leu Leu Val Cys Ile Asp Leu Trp Lys Ly s LysAla Leu His Asn 35 40 45 Phe Thr Phe Ile 50 <210> SEQ ID NO: 100 <211>LENGTH: 33 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 100 Ser Ser Phe Ser Glu Lys Ser His Asn Thr Se r Leu Trp TyrVal Met 1 5 10 15 Tyr Lys Asn Val Lys Ile Met Gly Phe Ile Il e Lys LysLys Tyr Trp 20 25 30 Leu <210> SEQ ID NO: 101 <211> LENGTH: 4 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 101 Met LysTyr Ser 1 <210> SEQ ID NO: 102 <211> LENGTH: 4 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 102 Asn Phe Arg Tyr 1 <210>SEQ ID NO: 103 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 103 Leu Val Trp Phe 1 <210> SEQ ID NO:104 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 104 Asn Val Phe Arg Asp Leu Ile Leu 1 5 <210> SEQ ID NO:105 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 105 Tyr Met Glu Glu Ser Ser Thr Lys Trp Leu 1 5 10 <210>SEQ ID NO: 106 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 106 Leu Thr Lys Gly Phe Thr Leu Met 1 5<210> SEQ ID NO: 107 <211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 107 His Leu Val Ser Lys Gln Ile Lys ThrLys Ly s Lys Lys Lys Ala Leu 1 5 10 15 <210> SEQ ID NO: 108 <211>LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 108 Asn Pro Lys Val Thr Ile Phe Lys Lys Ser Ly s Leu 1 5 10<210> SEQ ID NO: 109 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 109 Met Phe Gly Ile Ala Asn Asp Tyr Cys1 5 <210> SEQ ID NO: 110 <211> LENGTH: 17 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 110 Met Leu Asn Ile His GluAsp Val Lys Asn Me t Leu Asp Leu Trp Asn 1 5 10 15 Arg <210> SEQ ID NO:111 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 111 Pro Arg Leu Asn Gly Gly Ile 1 5 <210> SEQ ID NO: 112<211> LENGTH: 38 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 112 Asp Pro Leu Phe Leu Lys Ile Lys Phe Phe Hi s Ile Tyr TyrLeu Phe 1 5 10 15 Gln Arg Lys Lys Lys His Thr Thr Ile Ile Hi s Leu ProAla Val Phe 20 25 30 Ile Gly Lys Pro Ile Phe 35 <210> SEQ ID NO: 113<211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 113 Asn Trp Trp Ala Phe His Tyr His Lys Phe Gl y His Val PheIle Ile 1 5 10 15 <210> SEQ ID NO: 114 <211> LENGTH: 33 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 114 Arg Asn Lys Lys GlyAsn Leu Cys Gly Asp Cy s Asn Lys Thr Glu Ile 1 5 10 15 Ile Ile Leu AsnHis Ser Lys Arg Arg Lys As p Gln Thr Phe Val Ala 20 25 30 Arg <210> SEQID NO: 115 <211> LENGTH: 59 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 115 Thr Trp Leu Pro Ile Thr Val Leu Met Leu Le u TyrArg Ser Phe Leu 1 5 10 15 His Pro Leu Phe Leu His Ile Gln Glu Thr Va lSer Ser His Phe Leu 20 25 30 Ser Ser Ser Gln Cys Phe Asn Leu Cys Glu Leu Arg Trp Asn Met Lys 35 40 45 Lys His Lys Arg Thr Gln Glu Thr Ala GlyPr o 50 55 <210> SEQ ID NO: 116 <211> LENGTH: 5 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 116 Phe Asp His Phe Lys 1 5<210> SEQ ID NO: 117 <211> LENGTH: 57 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 117 Ser Pro Leu Ala Phe Leu Ala Ser SerSer Le u Tyr Leu Ser Ser Phe 1 5 10 15 Phe His Val Ser Leu Ser Ile ProPro Gln Le u Arg Ser Pro Ser Pro 20 25 30 Ala Phe Pro Leu Leu Phe ThrArg Gln Met Se r Glu Ser Tyr Thr Arg 35 40 45 Ser Cys Phe Ser Ser SerSer Ser Leu 50 55 <210> SEQ ID NO: 118 <211> LENGTH: 37 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 118 Ser Thr Val Ser GlnGlu Asn Gln Asn Ala Le u Phe Ser Ile Pro Ile 1 5 10 15 Ser Thr Ser AlaGly Ser Phe Ser Ser Ser Pr o Lys Leu Val Pro Leu 20 25 30 Gly Ser LysGlu Pro 35 <210> SEQ ID NO: 119 <211> LENGTH: 5 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 119 Ala Arg Pro Cys Leu 1 5<210> SEQ ID NO: 120 <211> LENGTH: 27 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 120 Ile Gln Thr Lys Thr Cys Phe Leu ArgHis Me t Lys Asp Gly Cys Trp 1 5 10 15 Leu Gly Phe Cys Ser Phe Trp GlyTyr Thr Ly s 20 25 <210> SEQ ID NO: 121 <211> LENGTH: 31 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 121 Cys Gly Leu Glu SerTrp Leu Ser Leu Trp Le u Thr Thr Leu Tyr Met 1 5 10 15 Gly Ser Thr TrpArg Arg Gly Gly Pro Arg Gl u Pro Leu Trp Gln 20 25 30 <210> SEQ ID NO:122 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 122 Cys Gly Gly Gly Gly 1 5 <210> SEQ ID NO: 123 <211>LENGTH: 23 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 123 Lys Val Leu Trp Trp Trp Leu Arg Arg Ile As p Leu Thr SerPro Phe 1 5 10 15 Val Trp Arg Val Ser Ile Leu 20 <210> SEQ ID NO: 124<211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 124 Thr Gly Val Cys Ile Thr Ser Val Leu Glu Le u Val 1 5 10<210> SEQ ID NO: 125 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 125 Arg Ser Ser Lys Gly Phe Trp Ile Leu1 5 <210> SEQ ID NO: 126 <211> LENGTH: 36 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 126 Ala Leu Arg Gly Arg GluLys Ala Val Asn Hi s Val Phe Leu Met Ile 1 5 10 15 Cys Val Met Met IleMet Cys Lys Ile Phe As p Ile Leu Tyr Ser Ser 20 25 30 Leu Glu Cys Phe 35<210> SEQ ID NO: 127 <211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 127 Asp Phe Phe Ile Phe Ile Phe Tyr PheLeu Le u Gly Ile 1 5 10 <210> SEQ ID NO: 128 <211> LENGTH: 7 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 128 Pro Val Tyr MetSer Gln Lys 1 5 <210> SEQ ID NO: 129 <211> LENGTH: 9 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 129 Asn Ile Arg Lys GlnLys Tyr Phe Ile 1 5 <210> SEQ ID NO: 130 <211> LENGTH: 14 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 130 Pro Leu Asn IleAsn Leu Ser Leu Phe Ile Il e Ile Phe Leu 1 5 10 <210> SEQ ID NO: 131<211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 131 His Thr Leu Phe Lys Lys Asn Leu Glu Ile 1 5 10 <210> SEQID NO: 132 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 132 Ile Val Lys Asn Ile Gly Phe Thr 1 5 <210> SEQ IDNO: 133 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 133 Met Arg Ile Ile Lys Phe Thr Asn 1 5 <210> SEQ ID NO:134 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 134 Pro Tyr Ile Tyr Phe 1 5 <210> SEQ ID NO: 135 <211>LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 135 Arg Phe Lys Leu Ile Leu Phe Leu Pro Tyr Me t His Asn Ile 15 10 <210> SEQ ID NO: 136 <211> LENGTH: 39 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 136 Leu Gly Met Asn Thr AsnIle Tyr Asn Asp Il e Asn Ile Ser Leu Thr 1 5 10 15 Gly His Ser Lys MetTyr Ile Leu Ile Tyr Gl n His Phe Phe Ile Gly 20 25 30 Leu Leu Asn GlnVal Val Thr 35 <210> SEQ ID NO: 137 <211> LENGTH: 35 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 137 Val Asn Ala Phe PhePhe Ile Ile Leu Tyr Me t Asn Leu Asn Leu Ser 1 5 10 15 Cys Gln Thr SerSer Lys Pro Asn Ile Tyr Il e His Ile Val Leu Tyr 20 25 30 Phe Glu Asn 35<210> SEQ ID NO: 138 <211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 138 Asn Phe Leu Lys Phe Pro Ile Leu PheSer Ph e 1 5 10 <210> SEQ ID NO: 139 <211> LENGTH: 55 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 139 Ser Lys Gln Val GlnIle Arg Phe Phe Gln Il e Ile Ile Phe Leu Asn 1 5 10 15 Lys Val Phe TyrLys Lys Lys Ser Thr Ser Ty r Leu Lys Asn Pro Leu 20 25 30 His Tyr ProPhe His Gln His Gln Arg Arg Ar g Glu Lys Lys Lys Arg 35 40 45 Arg ValVal Asn Gly Glu Gly 50 55 <210> SEQ ID NO: 140 <211> LENGTH: 6 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 140 Phe HisSer Lys His Ile 1 5 <210> SEQ ID NO: 141 <211> LENGTH: 15 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 141 Val Met Lys SerIle Tyr Phe Asn Cys Val Ph e Met Ile Asp Gln 1 5 10 15 <210> SEQ ID NO:142 <211> LENGTH: 19 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 142 His Leu Gly Leu Asn Phe Leu Val Ile Tyr Ty r Val IleArg Pro Met 1 5 10 15 His Asp Pro <210> SEQ ID NO: 143 <211> LENGTH: 4<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 143 AsnPhe Tyr Phe 1 <210> SEQ ID NO: 144 <211> LENGTH: 6 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 144 Ile Cys Leu Gly Lys Pro 15 <210> SEQ ID NO: 145 <211> LENGTH: 107 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 145 Gly Phe Ala Thr Arg Thr Lys Ser AspLys Ar g Ala Asn Arg Lys Gly 1 5 10 15 Glu Ile Ser Ala Tyr Gln Gly LysArg His Le u Val Ala Leu Ile Phe 20 25 30 Tyr Ser Leu Leu Tyr Val PheLeu Lys Ile Ly s Glu Arg Arg Gly Leu 35 40 45 Asn Leu Ile Thr Ile ArgPhe Gln Arg Asp Va l Lys Ile His Leu Ile 50 55 60 Asn Ser Tyr Thr LeuVal Ile Ile Phe Lys Th r Lys Lys Arg Asn Phe 65 70 75 80 Gln Thr Phe LysLeu Lys Thr Glu Phe Arg Ly s Cys Gln Arg Ile Asp 85 90 95 Asn Asp IleGln Ile Cys Arg Val Ser Lys Th r 100 105 <210> SEQ ID NO: 146 <211>LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 146 Asn Lys Lys Ile Ile Asn Ile Phe Ile Ile 1 5 10 <210> SEQID NO: 147 <211> LENGTH: 30 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 147 Ser Trp Asn Leu Gly Tyr Lys Ile Lys Leu Ly s IleIle Val Asp Phe 1 5 10 15 Phe Val Phe Val Lys Gln Asn Ser Asn Thr Il eCys Phe Phe 20 25 30 <210> SEQ ID NO: 148 <211> LENGTH: 5 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 148 Tyr Lys Glu ThrLys 1 5 <210> SEQ ID NO: 149 <211> LENGTH: 15 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 149 Val Gln Ile Val Phe PheLeu Thr Phe Ser Gl n Lys Ser Gln Asp 1 5 10 15 <210> SEQ ID NO: 150<211> LENGTH: 38 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 150 Cys Ile Tyr Gln Glu Ile Glu Ile Lys Thr Ph e Val Phe LysTyr Ser 1 5 10 15 Ser Phe Thr Ile Tyr Arg Val Gln Phe Leu Ly s Phe LysLys Ser Phe 20 25 30 Thr Tyr Ile Leu Leu Asp 35 <210> SEQ ID NO: 151<211> LENGTH: 147 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 151 Gln Arg Lys Phe Glu Leu Arg Tyr Ile Pro Se r Val Ala ThrHis Ala 1 5 10 15 Ser His His Gln Ser Phe Asp Leu Asn Gln Pr o Ala AlaGlu Asp Asp 20 25 30 Asn Gly Gly Asp Asn Lys Ser Leu Leu Ser Ar g MetGln Asn Pro Leu 35 40 45 Arg His Phe Ser Ala Ser Ser Asp Tyr Asn Se rTyr Glu Asp Gln Gly 50 55 60 Tyr Val Leu Asp Glu Asp Gln Asp Tyr Ala Leu Glu Glu Asp Val Pro 65 70 75 80 Leu Phe Leu Asp Glu Asp Val Pro LeuLeu Pr o Ser Val Lys Leu Pro 85 90 95 Ile Val Glu Lys Leu Pro Arg SerIle Thr Tr p Val Phe Thr Lys Arg 100 105 110 His Val Cys Phe Leu Phe ArgThr Ser Phe Ly s Ile Leu Ile Ile Tyr 115 120 125 Tyr Ile Val Ile Thr HisSer Ala Tyr Ile Hi s Phe Phe Asn Ile Ala 130 135 140 Val Ala Ser 145<210> SEQ ID NO: 152 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 152 Trp Leu Lys Val Ile Leu 1 5 <210>SEQ ID NO: 153 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 153 Leu Val Arg Asp Lys Ser Ile Ile 1 5<210> SEQ ID NO: 154 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 154 Met Val Arg His 1 <210> SEQ ID NO:155 <211> LENGTH: 26 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 155 Ala Val Lys Lys Met Arg Lys Met Lys Lys Ly s Met ArgLys Lys Ser 1 5 10 15 Arg Lys Lys Asn Ala Asn Phe Leu Lys Met 20 25<210> SEQ ID NO: 156 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 156 Thr Asp Leu Tyr Gly 1 5 <210> SEQ IDNO: 157 <211> LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 157 Phe Leu His Tyr Ile Cys Ser 1 5 <210> SEQ ID NO: 158<211> LENGTH: 25 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 158 Leu Leu Ile Cys Ser Pro Tyr Leu Ile Asn Cy s Ser Arg AsnPhe Gln 1 5 10 15 Asp Gly Trp Ala Gly Leu Trp Phe Gly 20 25 <210> SEQ IDNO: 159 <211> LENGTH: 32 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 159 Ser Gly Arg Ala Ala Cys Ser Arg Gln Val Pr o Arg SerGly Cys Phe 1 5 10 15 Gly His Ile Gly Asn Asn Ile Arg Ile Lys Th r SerTyr Val Asp Gln 20 25 30 <210> SEQ ID NO: 160 <211> LENGTH: 12 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 160 Leu SerCys Leu Phe Asn Phe Cys Cys Phe Se r Ser 1 5 10 <210> SEQ ID NO: 161<211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 161 Ile Phe Lys Ser Asn Val Gly Lys Ile Gln 1 5 10 <210> SEQID NO: 162 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 162 Trp Asn Cys Trp 1 <210> SEQ ID NO: 163 <211>LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 163 Phe Asp Ile Gln Asp Asn Asn Tyr Cys Phe Pr o Gly Phe Cys 15 10 <210> SEQ ID NO: 164 <211> LENGTH: 59 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 164 Thr Ser Leu Pro Ser LeuHis Gly Asn Phe Gl u Ser Phe Phe Phe Asn 1 5 10 15 Leu Ala Thr Lys LysGly Asp Asp His Thr Cy s Phe Tyr Phe Ile Leu 20 25 30 Ser Phe Val LeuGln Ile Phe Asp Cys His Me t His Glu Lys Tyr Glu 35 40 45 Pro Glu SerArg Ser Val Ser Ile Lys Phe Il e 50 55 <210> SEQ ID NO: 165 <211>LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 165 Ile Ile Leu Leu Val Ser Gln Pro Leu Tyr Il e Arg Leu SerAsp 1 5 10 15 <210> SEQ ID NO: 166 <211> LENGTH: 56 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 166 Ile Ala Leu Ala CysGln Ser Glu Asp Lys Se r Ser Leu Phe Glu Asp 1 5 10 15 Glu Asp Arg GlnPro Cys Ser Glu His Cys Ty r Leu Lys Val Ser Ile 20 25 30 Ser Leu ProLeu Ser Leu Asn Phe Phe Val Ty r Ser Leu Ile Thr Phe 35 40 45 Ile SerTyr Trp Phe Asn Ile Lys 50 55 <210> SEQ ID NO: 167 <211> LENGTH: 50<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 167 ValArg Ser Val Thr Glu Ala Asp His Val Me t Asp Asn Asp Asn Ser 1 5 10 15Ile Ser Asn Lys Ile Val Val Ser Asp Pro As n Asn Thr Met Trp Thr 20 2530 Pro Val Glu Lys Asp Leu Tyr Leu Lys Gly Il e Glu Ile Phe Gly Arg 3540 45 Asn Arg 50 <210> SEQ ID NO: 168 <211> LENGTH: 68 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 168 Lys Asn Lys Asn ArgPhe Asn Ala Leu Ile Ty r Ile Leu Thr Leu Tyr 1 5 10 15 Ser Leu Ile MetLeu Val Arg Ser Cys Asp Va l Ala Leu Asn Ile Leu 20 25 30 Arg Gly LeuLys Thr Cys Leu Glu Ile Tyr As n Tyr Met Arg Glu Gln 35 40 45 Asp GlnCys Thr Met Ser Leu Asp Leu Asn Ly s Thr Thr Gln Arg His 50 55 60 AsnGln Val His 65 <210> SEQ ID NO: 169 <211> LENGTH: 23 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 169 Lys His Met Lys PhePro Ile Cys Val Asp Gl y Phe Ile Thr Gly Tyr 1 5 10 15 Gln Lys Ser IleSer Lys Lys 20 <210> SEQ ID NO: 170 <211> LENGTH: 22 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 170 Val Gly Pro Gln LysIle Glu Thr Pro Lys Il e Cys Ser Leu Ser Ala 1 5 10 15 Cys Phe Lys GluAsn Asn 20 <210> SEQ ID NO: 171 <211> LENGTH: 41 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 171 Ala Leu His Thr Met HisLeu Gln Val Lys Me t Trp Thr Ala Met Pro 1 5 10 15 Leu Phe Asn Ser ArgLys Leu Leu Arg Glu Il e Leu Arg Val Cys His 20 25 30 Ser Ile Phe ProLys Pro Glu Asp Pro 35 40 <210> SEQ ID NO: 172 <211> LENGTH: 108 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 172 Val CysIle Phe Cys Ser Gly Ala Gln Arg Il e Ala Thr Ile Ala Leu 1 5 10 15 GluAsp Val Ile Val Gln Leu Ala Asn Ala Gl n Ile Asp Asn Val Leu 20 25 30Val Leu Leu Leu Ile Val Asn Ala Ile Gln Il e Phe Val Gly Val Val 35 4045 Leu Leu Gly Asn Thr Phe Thr Ser Ile Ser Le u Tyr Thr Asn Ser Ile 5055 60 Ile Lys Val Ile Gln Thr Lys Ser Leu Ile Ly s Lys Thr Leu Tyr Ile65 70 75 80 Ala Val Glu Met Ala Leu Leu Val Arg His Gl n Cys Lys Ser AsnAla 85 90 95 Arg Thr Cys Asn Ser Ser Phe Lys Pro Ile Ly s Arg 100 105<210> SEQ ID NO: 173 <211> LENGTH: 17 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 173 Ser Thr Ser Asn Pro Tyr Arg Lys PheLys Th r Asn Tyr Thr Lys Asp 1 5 10 15 Ile <210> SEQ ID NO: 174 <211>LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 174 Leu Ser Phe Pro Val Phe Tyr 1 5 <210> SEQ ID NO: 175 <211>LENGTH: 39 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 175 Ile Leu Ile Gly Lys Ser Asp Val His Gly Tr p Gly Ala PheThr Trp 1 5 10 15 Val Ser Asn His Val Asn Ile Arg Ile Ser Le u Ile ValIle Gly Ala 20 25 30 Phe Ile Thr Leu Phe Phe Phe 35 <210> SEQ ID NO: 176<211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 176 Cys Phe Ile Leu 1 <210> SEQ ID NO: 177 <211> LENGTH: 6<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 177 ThrIle Lys Tyr Ile Val 1 5 <210> SEQ ID NO: 178 <211> LENGTH: 53 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 178 Tyr GlyLeu Thr Arg Gln Asp Ser Leu Lys Ly s Asn Glu Tyr Leu Gly 1 5 10 15 GluTyr Thr Gly Glu Leu Ile Thr His Asp Gl u Ala Asn Glu Arg Gly 20 25 30Arg Ile Glu Asp Arg Ile Gly Ser Ser Tyr Le u Phe Thr Leu Asn Asp 35 4045 Gln Val Thr Ser Glu 50 <210> SEQ ID NO: 179 <211> LENGTH: 28 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 179 Ser AsnVal Leu Ile Ile Arg Gly Leu His Il e Tyr Ser Asn Gln Ser 1 5 10 15 AsnIle Tyr Phe Thr Ala Arg Asn Arg Cys Se r Pro 20 25 <210> SEQ ID NO: 180<211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 180 Arg Lys Arg Val Gln Ile Ser Gln Ser Leu Se r Lys Thr 1 510 <210> SEQ ID NO: 181 <211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 181 Leu Leu Arg Gln Gly Thr Lys Pro LeuTyr Ph e Ile Leu Asn Lys Tyr 1 5 10 15 <210> SEQ ID NO: 182 <211>LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 182 His Tyr Thr Asn Lys Asn Thr Tyr Val Ser Ph e Phe Ser 1 510 <210> SEQ ID NO: 183 <211> LENGTH: 24 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 183 Ile Val Tyr Gln Leu Tyr Ser Ser LeuIle Gl y Phe His Ile Glu Asp 1 5 10 15 Ile Pro Arg Asn Ser Asn Ser Phe20 <210> SEQ ID NO: 184 <211> LENGTH: 78 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 184 Met Ile Phe Ser Cys Arg Glu Asn LeuGly Ty r Glu Asn Leu Trp Phe 1 5 10 15 Arg Val Gln Leu Met Ile Val ArgGly Asp Gl n Arg Ile Gly Leu Phe 20 25 30 Ala Glu Arg Ala Ile Glu GluGly Glu Glu Le u Phe Phe Asp Tyr Cys 35 40 45 Tyr Gly Pro Glu His AlaAsp Trp Ser Arg Gl y Arg Glu Pro Arg Lys 50 55 60 Thr Gly Ala Ser LysArg Ser Lys Glu Ala Ar g Pro Ala Arg 65 70 75 <210> SEQ ID NO: 185 <211>LENGTH: 37 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 185 Gly Glu Ala Ala Ile Gln Ala Val Leu Phe Le u Cys Tyr GlyIle Ser 1 5 10 15 Ile Asn Asn Val Met Leu Phe Cys Val Thr Ly s Pro LysLeu Lys Phe 20 25 30 Leu Phe Tyr Leu Phe 35 <210> SEQ ID NO: 186 <211>LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 186 Gly Val Leu Phe Val Ser Tyr Val Ser 1 5 <210> SEQ ID NO:187 <211> LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 187 Leu Ser Lys Phe Ser Phe Cys Ile Ser Ile 1 5 10 <210>SEQ ID NO: 188 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 188 Lys Gln Cys Leu Cys Cys 1 5 <210>SEQ ID NO: 189 <211> LENGTH: 29 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 189 Thr Phe Gly Lys Lys Lys Leu Cys ThrThr Le u His Leu Phe Ser Leu 1 5 10 15 His Leu Ala Lys Asn His Ile ThrGln Val Cy s Gly Thr 20 25 <210> SEQ ID NO: 190 <211> LENGTH: 6 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 190 Cys ThrLys Met Ser Lys 1 5 <210> SEQ ID NO: 191 <211> LENGTH: 12 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 191 Trp Val Leu SerLeu Lys Lys Asn Ile Gly Ty r Glu 1 5 10 <210> SEQ ID NO: 192 <211>LENGTH: 19 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 192 Ser Ile Val Arg Ile Leu Gly Ile Ser Ser Ph e Gly Phe LysThr Phe 1 5 10 15 Phe Glu Ile <210> SEQ ID NO: 193 <211> LENGTH: 24<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 193 PheCys Ser Leu Leu Ser Asn Thr Trp Lys As n His Gln Gln Ser Gly 1 5 10 15Cys Ser Leu Arg Lys Val Leu Leu 20 <210> SEQ ID NO: 194 <211> LENGTH: 10<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 194 CysLys Tyr Val Phe Asp Ala Ser Asn Ile 1 5 10 <210> SEQ ID NO: 195 <211>LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 195 Tyr Leu Asn Lys 1 <210> SEQ ID NO: 196 <211> LENGTH: 13<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 196 LysGln Lys Lys Arg Lys Lys Leu Phe Lys Il e Arg Lys 1 5 10 <210> SEQ ID NO:197 <211> LENGTH: 24 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 197 Leu Phe Ser Lys Asn Leu Asn Tyr Lys Leu Ly s Cys LeuGlu Ser Arg 1 5 10 15 Thr Thr Ile Ala Lys Tyr Lys Cys 20 <210> SEQ IDNO: 198 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 198 Ile Tyr Met Lys Met 1 5 <210> SEQ ID NO: 199 <211>LENGTH: 13 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 199 Lys Thr Cys Trp Ile Cys Gly Ile Val Asn As p His Gly 1 510 <210> SEQ ID NO: 200 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 200 Met Ala Gly Ser 1 <210> SEQ ID NO:201 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 201 Ile His Tyr Phe 1 <210> SEQ ID NO: 202 <211> LENGTH:48 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 202Lys Ser Asn Phe Phe Ile Ser Ile Ile Cys Ph e Lys Glu Lys Lys Asn 1 5 1015 Thr Arg Arg Leu Ser Ile Cys Arg Leu Cys Se r Ser Val Asn Leu Tyr 2025 30 Phe Lys Thr Gly Gly Leu Phe Ile Thr Ile Se r Leu Asp Met Phe Leu35 40 45 <210> SEQ ID NO: 203 <211> LENGTH: 24 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 203 Cys Arg Pro Lys Asn ArgGlu Ile Arg Lys Gl y Thr Phe Val Val Ile 1 5 10 15 Val Thr Lys Gln LysSer Leu Tyr 20 <210> SEQ ID NO: 204 <211> LENGTH: 11 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 204 Ile Ile Arg Lys AspGlu Lys Ile Lys Pro Le u 1 5 10 <210> SEQ ID NO: 205 <211> LENGTH: 12<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 205 LeuAsp Asp His Arg Arg Gly Cys Gln Leu Gl n Ser 1 5 10 <210> SEQ ID NO: 206<211> LENGTH: 34 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 206 Cys Phe Tyr Ile Asp Leu Ser Tyr Ile Leu Cy s Ser Phe ThrPhe Lys 1 5 10 15 Lys Gln Tyr His Pro Ile Phe Phe Leu Leu Le u Ser ValSer Ile Phe 20 25 30 Ala Asn <210> SEQ ID NO: 207 <211> LENGTH: 21 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 207 Arg AsnThr Lys Glu His Lys Lys Gln Leu Va l Pro Asp Ser Thr Ile 1 5 10 15 SerAsn Asp Leu His 20 <210> SEQ ID NO: 208 <211> LENGTH: 106 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 208 Pro Pro Pro ProSer Ile Phe Pro Leu Ser Ph e Thr Ser Leu Ser Leu 1 5 10 15 Tyr Leu LeuAsn Ser Gly His Arg Leu Arg Ar g Phe Leu Cys Tyr Ser 20 25 30 Pro GlyArg Cys Arg Ser Leu Ile His Asp Le u Val Ser His His Arg 35 40 45 LeuHis Phe Asn Pro Gln Ser Leu Arg Lys Th r Arg Met Leu Cys Ser 50 55 60Pro Phe Pro Ser Leu His Leu Leu Asp Arg Se r Leu His Arg Pro Ser 65 7075 80 Leu Cys Leu Trp Asp Gln Lys Asn His Glu Hi s Asp His Val Tyr Lys85 90 95 Ser Arg Gln Lys Leu Val Ser Cys Asp Thr 100 105 <210> SEQ IDNO: 209 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 209 Lys Met Asp Val Gly 1 5 <210> SEQ ID NO: 210 <211>LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 210 Gly Phe Val Leu Phe Gly Ala Thr Arg Ser As p Ala Asp ValVal 1 5 10 15 <210> SEQ ID NO: 211 <211> LENGTH: 32 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 211 Gln His Tyr Ile TrpGly Leu Arg Gly Gly Gl u Val Val Arg Glu Ser 1 5 10 15 Arg Cys Gly SerAsp Leu Trp Tyr Asn Val Va l Val Glu Ala Lys Arg 20 25 30 <210> SEQ IDNO: 212 <211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 212 Gly Arg Lys Ser Cys Gly Gly Gly Tyr Gly Gl y 1 5 10<210> SEQ ID NO: 213 <211> LENGTH: 42 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 213 Pro Pro His Ser Phe Gly Gly Ser GlnPhe Cy s Glu Leu Val Tyr Val 1 5 10 15 Leu His Leu Cys Trp Asn Trp PheAsn Glu As p Leu Gln Arg Val Phe 20 25 30 Gly Phe Cys Glu Tyr Val AspPhe Glu His 35 40 <210> SEQ ID NO: 214 <211> LENGTH: 6 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 214 Glu Val Glu Lys ArgLeu 1 5 <210> SEQ ID NO: 215 <211> LENGTH: 4 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 215 Ile Met Cys Phe 1 <210>SEQ ID NO: 216 <211> LENGTH: 27 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 216 Ser Cys Val Arg Tyr Leu Thr Tyr TyrThr Hi s Leu Leu Asn Val Phe 1 5 10 15 Glu Ile Phe Leu Phe Leu Phe SerIle Ser Cy s 20 25 <210> SEQ ID NO: 217 <211> LENGTH: 25 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 217 Glu Phe Asn Pro TyrIle Cys His Lys Asn Se r Arg Ile Ser Glu Ser 1 5 10 15 Lys Asn Ile LeuSer Lys Asn Asn His 20 25 <210> SEQ ID NO: 218 <211> LENGTH: 16 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 218 Leu TyrPhe Tyr Asn Thr Pro Phe Leu Arg Ly s Thr Trp Arg Phe Asn 1 5 10 15 <210>SEQ ID NO: 219 <211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 219 Lys Ile Ser Asp Leu Arg Arg Ser PheLys Cy s Val 1 5 10 <210> SEQ ID NO: 220 <211> LENGTH: 6 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 220 Leu Asn Leu Arg IleGlu 1 5 <210> SEQ ID NO: 221 <211> LENGTH: 25 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 221 Tyr Ser His Ile Tyr IlePhe Glu Asp Leu As n Ser Phe Cys Phe Phe 1 5 10 15 His Ile Cys Ile IleTyr Lys Leu Lys 20 25 <210> SEQ ID NO: 222 <211> LENGTH: 9 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 222 Ile Leu Ile TyrIle Met Thr Leu Ile 1 5 <210> SEQ ID NO: 223 <211> LENGTH: 10 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 223 Val LeuPro Asp Thr Pro Lys Cys Ile Tyr 1 5 10 <210> SEQ ID NO: 224 <211>LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 224 Ser Ile Asn Ile Phe Ser Leu Val Tyr 1 5 <210> SEQ ID NO:225 <211> LENGTH: 14 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 225 Thr Lys Leu Ser His Lys Tyr Glu Leu Thr Pr o Phe PheLeu 1 5 10 <210> SEQ ID NO: 226 <211> LENGTH: 13 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 226 Ala Val Lys Arg Gln AlaAsn Pro Thr Ser Th r Tyr Ile 1 5 10 <210> SEQ ID NO: 227 <211> LENGTH:30 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 227Tyr Tyr Ile Leu Lys Ile Lys Ile Phe Leu As n Phe Pro Tyr Tyr Phe 1 5 1015 Pro Phe Lys Ala Ser Lys Ser Lys Tyr Val Se r Ser Arg Leu 20 25 30<210> SEQ ID NO: 228 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 228 Phe Ser Leu Ile Arg Phe Ser Thr LysLys As n Gln Leu Leu Ile 1 5 10 15 <210> SEQ ID NO: 229 <211> LENGTH: 39<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 229 LysThr Leu Cys Ile Ile Leu Phe Thr Asn Il e Arg Glu Asp Glu Lys 1 5 10 15Lys Arg Arg Gly Glu Trp Leu Met Glu Lys Va l Ser Phe Thr Pro Asn 20 2530 Ile Tyr Glu Leu Thr Arg Leu 35 <210> SEQ ID NO: 230 <211> LENGTH: 9<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 230 AsnPro Tyr Ile Leu Ile Val Cys Leu 1 5 <210> SEQ ID NO: 231 <211> LENGTH: 4<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 231 IleAsn Asn Ile 1 <210> SEQ ID NO: 232 <211> LENGTH: 19 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 232 Ser Ile Met Leu PheVal Pro Cys Met Ile Hi s Lys Thr Phe Ile Phe 1 5 10 15 Glu Phe Val <210>SEQ ID NO: 233 <211> LENGTH: 37 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 233 Glu Asn His Glu Asp Asp Gly Glu GlyLeu Pr o Pro Glu Leu Asn Gln 1 5 10 15 Ile Lys Glu Gln Ile Glu Lys GluArg Phe Le u His Ile Lys Val Arg 20 25 30 Asp Ile Trp Leu Leu 35 <210>SEQ ID NO: 234 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 234 Tyr Phe Ile Leu Phe Cys Met Phe Phe1 5 <210> SEQ ID NO: 235 <211> LENGTH: 10 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 235 Lys Leu Arg Arg Gly GluAsp Leu Ile Ser 1 5 10 <210> SEQ ID NO: 236 <211> LENGTH: 11 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 236 Leu Tyr Asp SerLys Glu Met Leu Arg Tyr Il e 1 5 10 <210> SEQ ID NO: 237 <211> LENGTH: 4<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 237 ThrVal Ile His 1 <210> SEQ ID NO: 238 <211> LENGTH: 13 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 238 Ser Leu Lys Leu LysArg Glu Ile Ser Lys Le u Leu Asn 1 5 10 <210> SEQ ID NO: 239 <211>LENGTH: 26 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 239 Lys Gln Asn Leu Glu Asn Ala Ser Glu Ser Il e Thr Thr SerArg Ser 1 5 10 15 Val Gly Tyr Pro Lys Leu Arg Ile Lys Lys 20 25 <210>SEQ ID NO: 240 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 240 Leu Ile Tyr Leu 1 <210> SEQ ID NO:241 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 241 Tyr Lys Ala Gly Thr 1 5 <210> SEQ ID NO: 242 <211>LENGTH: 29 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 242 Ile Phe Leu Phe Leu Ser Asn Lys Ile Val Il e Gln Phe ValPhe Phe 1 5 10 15 Ser Thr Lys Lys Leu Asn Arg Ser Lys Leu Ph e Phe Phe20 25 <210> SEQ ID NO: 243 <211> LENGTH: 34 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 243 His Ser Ala Lys Lys AlaLys Ile Asp Ala Ty r Ile Lys Lys Ser Lys 1 5 10 15 Ser Lys Leu Leu TyrSer Ser Ile Leu Val Se r Leu Tyr Ile Glu Ser 20 25 30 Ser Phe <210> SEQID NO: 244 <211> LENGTH: 18 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 244 Asn Leu Lys Asn His Leu Pro Ile Tyr Tyr Le u IleAsn Arg Glu Asn 1 5 10 15 Ser Ser <210> SEQ ID NO: 245 <211> LENGTH: 17<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 245 AspThr Phe Gln Val Trp Leu Leu Met Leu Hi s Thr Ile Asn Arg Leu 1 5 10 15Thr <210> SEQ ID NO: 246 <211> LENGTH: 100 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 246 Thr Ser Pro Leu Gln ArgMet Ile Met Glu Gl u Thr Thr Asn His Phe 1 5 10 15 Cys Arg Glu Cys LysThr His Phe Val Ile Se r Val Pro His Leu Ile 20 25 30 Ile Ile Leu ThrLys Ile Lys Val Met Phe Le u Met Arg Ile Lys Ile 35 40 45 Met Leu LeuLys Lys Met Tyr His Tyr Phe Le u Met Lys Met Tyr His 50 55 60 Tyr TyrGln Val Ser Ser Phe Gln Leu Leu Ar g Ser Tyr His Asp Pro 65 70 75 80 LeuHis Gly Ser Ser Pro Lys Gly Met Cys Va l Phe Cys Phe Val Leu 85 90 95Val Ser Lys Tyr 100 <210> SEQ ID NO: 247 <211> LENGTH: 4 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 247 Ser Tyr Thr Ile 1<210> SEQ ID NO: 248 <211> LENGTH: 13 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 248 Ser Leu Ile Val His Ile Tyr Ile SerLeu Th r Leu Gln 1 5 10 <210> SEQ ID NO: 249 <211> LENGTH: 4 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 249 Pro Ala Asp Gly1 <210> SEQ ID NO: 250 <211> LENGTH: 4 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 250 Phe Cys Asp Trp 1 <210> SEQ ID NO:251 <211> LENGTH: 8 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 251 Glu Thr Asn Leu Leu Phe Glu Trp 1 5 <210> SEQ ID NO:252 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 252 Gly Thr Arg Ile Glu Gln 1 5 <210> SEQ ID NO: 253<211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 253 Gly Arg Asn Gln Glu Arg Lys Met Arg Ile Ph e 1 5 10 <210>SEQ ID NO: 254 <211> LENGTH: 18 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 254 Arg Cys Arg Pro Ile Tyr Met Val SerPhe Cy s Ile Thr Tyr Val Leu 1 5 10 15 Asp Tyr <210> SEQ ID NO: 255<211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 255 Phe Val Val His Ile 1 5 <210> SEQ ID NO: 256 <211> LENGTH:41 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 256Thr Ala Gln Glu Ile Phe Arg Thr Val Gly Gl n Asp Tyr Gly Leu Asp 1 5 1015 Asp Leu Val Val Arg Arg Ala Leu Ala Lys Ty r Leu Glu Val Asp Val 2025 30 Ser Asp Ile Leu Val Thr Ile Phe Glu 35 40 <210> SEQ ID NO: 257<211> LENGTH: 30 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 257 Lys Leu His Thr Ser Ile Asn Asn Phe Pro Al a Tyr Leu IlePhe Val 1 5 10 15 Val Phe Arg Arg Glu Lys Cys Phe Lys Phe Se r Asn LeuMet 20 25 30 <210> SEQ ID NO: 258 <211> LENGTH: 51 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 258 Glu Arg Tyr Asn Glu LeuLys Leu Lys Asn As p Gly Thr Ala Gly Glu 1 5 10 15 Ala Ser Asp Leu ThrSer Lys Thr Ile Thr Th r Ala Phe Gln Asp Phe 20 25 30 Ala Asp Arg ArgHis Cys Arg Arg Cys Met Va l Thr Leu Asn Leu Ser 35 40 45 Phe Leu Ile 50<210> SEQ ID NO: 259 <211> LENGTH: 36 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 259 Pro Gln Lys Arg Glu Met Ile Ile HisVal Ph e Ile Leu Phe Tyr His 1 5 10 15 Leu Phe Tyr Arg Tyr Ser Ile ValIle Cys Me t Arg Ser Met Ser Pro 20 25 30 Ser Leu Asp Pro 35 <210> SEQID NO: 260 <211> LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 260 Ala Leu Asn Ser Phe Lys Leu Phe Cys 1 5 <210>SEQ ID NO: 261 <211> LENGTH: 6 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 261 Phe His Asn Pro Tyr Ile 1 5 <210>SEQ ID NO: 262 <211> LENGTH: 50 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 262 Val Ile Asn Leu Ile Arg Leu Leu TrpLeu Va l Arg Ala Lys Thr Asn 1 5 10 15 Leu Val Cys Leu Arg Met Lys IleAsp Asn Hi s Ala Val Ser Ile Val 20 25 30 Thr Ser Arg Ser Leu Ser LeuSer Leu Ser Le u Ser Ile Phe Leu Ser 35 40 45 Ile Pro 50 <210> SEQ IDNO: 263 <211> LENGTH: 12 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 263 Leu Arg Leu Leu Val Thr Gly Leu Ile Leu As n Arg 1 510 <210> SEQ ID NO: 264 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 264 Gln Lys Leu Ile Met 1 5 <210> SEQ IDNO: 265 <211> LENGTH: 23 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 265 Trp Ile Met Ile Thr Leu Tyr Gln Thr Arg Le u Trp SerGln Ile Gln 1 5 10 15 Thr Thr Leu Cys Gly Arg Leu 20 <210> SEQ ID NO:266 <211> LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 266 Arg Arg Ile Phe Thr 1 5 <210> SEQ ID NO: 267 <211>LENGTH: 19 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 267 Lys Glu Leu Arg Tyr Leu Gly Glu Thr Gly Ly s Lys Ile LysIle Asp 1 5 10 15 Leu Met His <210> SEQ ID NO: 268 <211> LENGTH: 8 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 268 Tyr IleTyr Leu His Cys Ile Pro 1 5 <210> SEQ ID NO: 269 <211> LENGTH: 10 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 269 Leu CysTrp Phe Ala Val Val Met Leu His 1 5 10 <210> SEQ ID NO: 270 <211>LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 270 Thr Tyr Phe Gly Gly Leu Arg Arg Ala 1 5 <210> SEQ ID NO:271 <211> LENGTH: 15 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 271 Arg Phe Thr Ile Thr Cys Ala Asn Lys Ile As n Val LeuCys His 1 5 10 15 <210> SEQ ID NO: 272 <211> LENGTH: 28 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 272 Thr Leu Thr Lys LeuHis Lys Asp Thr Ile Ar g Tyr Thr Asn Leu Cys 1 5 10 15 Arg Asn Tyr SerHis Asp Met Tyr Val Lys As n Thr 20 25 <210> SEQ ID NO: 273 <211>LENGTH: 95 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 273 Ser Phe Leu Tyr Val Leu Met Val Leu Ser Gl n Val Thr LysLys Val 1 5 10 15 Ser Arg Lys Ser Ser Arg Ser Val Arg Lys Ly s Ser ArgLeu Arg Lys 20 25 30 Tyr Ala Arg Tyr Pro Pro Ala Leu Lys Lys Th r ThrSer Gly Glu Ala 35 40 45 Lys Phe Tyr Lys His Tyr Thr Pro Cys Thr Cy sLys Ser Lys Cys Gly 50 55 60 Gln Gln Cys Pro Cys Leu Thr His Glu Asn Cys Cys Glu Lys Tyr Cys 65 70 75 80 Gly Tyr Val Ile Gln Phe Phe Leu SerArg Ly s Ile His Glu Ile 85 90 95 <210> SEQ ID NO: 274 <211> LENGTH: 22<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 274 PheGlu His Glu Phe Val Phe Phe Val Gln Va l Leu Lys Gly Leu Gln 1 5 10 15Gln Ser Leu Trp Arg Met 20 <210> SEQ ID NO: 275 <211> LENGTH: 16 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 275 Leu CysAsn Trp Pro Met His Lys Ser Thr Me t Ser Leu Phe Cys Cys 1 5 10 15 <210>SEQ ID NO: 276 <211> LENGTH: 11 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 276 Met Arg Ser Arg Ser Leu Ser Glu LeuSer Se r 1 5 10 <210> SEQ ID NO: 277 <211> LENGTH: 13 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 277 Val Thr Leu Ser LeuGln Tyr Leu Phe Ile Gl n Ile Leu 1 5 10 <210> SEQ ID NO: 278 <211>LENGTH: 6 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 278 Phe Lys Pro Lys Val Leu 1 5 <210> SEQ ID NO: 279 <211>LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 279 Lys Lys Leu Tyr Ile 1 5 <210> SEQ ID NO: 280 <211> LENGTH:7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 280Leu Trp Arg Trp His Ser Trp 1 5 <210> SEQ ID NO: 281 <211> LENGTH: 17<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 281 AspThr Ser Ala Asn Pro Met Gln Glu His Al a Ile Pro Pro Ser Asn 1 5 10 15Gln <210> SEQ ID NO: 282 <211> LENGTH: 45 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 282 Lys Gly Asn Gln Arg GlnIle Arg Thr Glu As n Leu Lys Leu Ile Ile 1 5 10 15 Arg Lys Thr Phe AsnTyr His Phe Pro Tyr Ph e Thr Arg Phe Ser Leu 20 25 30 Glu Ser Leu MetPhe Met Asp Gly Val His Le u His Gly 35 40 45 <210> SEQ ID NO: 283 <211>LENGTH: 5 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 283 Leu Leu Val His Ser 1 5 <210> SEQ ID NO: 284 <211> LENGTH:21 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 284His Phe Phe Phe Phe Asn Asn Val Leu Tyr Ph e Arg Pro Leu Asn Ile 1 5 1015 Leu Cys Asp Met Val 20 <210> SEQ ID NO: 285 <211> LENGTH: 18 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 285 Pro ValArg Thr Leu Leu Lys Arg Met Ser Il e Ser Glu Asn Ile Leu 1 5 10 15 GluAsn <210> SEQ ID NO: 286 <211> LENGTH: 11 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 286 Ser Leu Met Met Lys LeuMet Ser Val Gly Gl u 1 5 10 <210> SEQ ID NO: 287 <211> LENGTH: 11 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 287 Lys IleGly Leu Val Leu Pro Thr Ser Leu Pr o 1 5 10 <210> SEQ ID NO: 288 <211>LENGTH: 9 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 288 Leu Gln Asn Asn Phe Glu Val Thr Phe 1 5 <210> SEQ ID NO:289 <211> LENGTH: 51 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp.<400> SEQUENCE: 289 Ser Phe Ala Gly Tyr Thr Ser Ile Arg Ile Ly s Val ThrPhe Ile Leu 1 5 10 15 Gln Leu Glu Ile Asp Ala Arg Arg Lys Gly As n GluPhe Lys Phe Leu 20 25 30 Asn His Ser Ala Arg Pro Asn Cys Tyr Ala Ly sVal Leu Ser Arg Tyr 35 40 45 Thr Leu Ser 50 <210> SEQ ID NO: 290 <211>LENGTH: 26 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 290 Thr Asn Thr Asn Ile Ile Gln Thr Lys Ile Le u Met Leu ValSer Leu 1 5 10 15 Val Lys Ser Cys Ile Asn Phe Thr Arg Arg 20 25 <210>SEQ ID NO: 291 <211> LENGTH: 16 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 291 Leu Val Phe Ile Leu Lys Ile Phe GlnGlu Th r Gln Thr His Phe Lys 1 5 10 15 <210> SEQ ID NO: 292 <211>LENGTH: 7 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 292 Phe Phe Leu Val Glu Lys Ile 1 5 <210> SEQ ID NO: 293 <211>LENGTH: 10 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 293 Val Thr Lys Ile Tyr Gly Phe Val Cys Ser 1 5 10 <210> SEQID NO: 294 <211> LENGTH: 57 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 294 Glu Glu Ile Arg Gly Leu Val Tyr Leu Arg Ar g GluGln Ser Lys Lys 1 5 10 15 Val Arg Ser Phe Ser Ser Thr Thr Ala Met As pGln Asn Met Arg Ile 20 25 30 Gly Arg Val Val Glu Asn Leu Glu Arg Leu Val Leu Leu Lys Gly Leu 35 40 45 Arg Lys Pro Val Gln Leu Val Ser Phe 50 55<210> SEQ ID NO: 295 <211> LENGTH: 21 <212> TYPE: PRT <213> ORGANISM:Arabidopsis sp. <400> SEQUENCE: 295 Ser Glu Glu Lys Gln Gln Phe Lys GlnSer Ph e Phe Tyr Val Met Val 1 5 10 15 Tyr Gln Leu Ile Met 20 <210> SEQID NO: 296 <211> LENGTH: 66 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 296 Cys Tyr Phe Val Leu Leu Asn Gln Asn Leu Se r PheCys Phe Ile Cys 1 5 10 15 Phe Arg Val Phe Cys Leu Tyr His Met Cys Le uAsn Phe Gln Ser Phe 20 25 30 Leu Phe Val Phe Gln Phe Lys Asn Asn Val Tyr Val Val Ser Leu His 35 40 45 Arg Pro Leu Glu Lys Lys Ser Phe Ala GlnLe u Tyr Ile Tyr Leu Val 50 55 60 Phe Ile 65 <210> SEQ ID NO: 297 <211>LENGTH: 4 <212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400>SEQUENCE: 297 Arg Lys Ile Thr 1 <210> SEQ ID NO: 298 <211> LENGTH: 18<212> TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 298 HisLys Ser Val Val Arg Asn Val Gln Lys Cy s Gln Asn Asn Gly Phe 1 5 10 15Tyr His <210> SEQ ID NO: 299 <211> LENGTH: 9 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 299 Lys Lys Ile Leu Val MetAsn Glu Val 1 5 <210> SEQ ID NO: 300 <211> LENGTH: 18 <212> TYPE: PRT<213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 300 Val Leu Ala Arg LeuVal Leu Lys Arg Phe Se r Arg Phe Asn Phe Val 1 5 10 15 Val Tyr <210> SEQID NO: 301 <211> LENGTH: 32 <212> TYPE: PRT <213> ORGANISM: Arabidopsissp. <400> SEQUENCE: 301 Val Ile His Gly Arg Ile Ile Asn Lys Val Al a ValAla Tyr Glu Arg 1 5 10 15 Phe Tyr Phe Asn Val Asn Met Tyr Leu Met Hi sLeu Thr Phe Ser Ile 20 25 30 <210> SEQ ID NO: 302 <211> LENGTH: 22 <212>TYPE: PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 302 Thr AsnLys Asn Lys Lys Lys Glu Lys Ser Se r Leu Lys Ser Glu Ser 1 5 10 15 AsnTyr Phe Gln Lys Ile 20 <210> SEQ ID NO: 303 <211> LENGTH: 21 <212> TYPE:PRT <213> ORGANISM: Arabidopsis sp. <400> SEQUENCE: 303 Ile Ile Asn LeuAsn Val Trp Asn Arg Glu Ar g Leu Leu Leu Asn Ile 1 5 10 15 Asn Ala LysTyr Thr 20 <210> SEQ ID NO: 304 <211> LENGTH: 20 <212> TYPE: PRT <213>ORGANISM: Arabidopsis sp. <400> SEQUENCE: 304 Arg Cys Glu Lys His ValGly Phe Val Glu Se r Leu Met Thr Thr Val 1 5 10 15 Lys Trp Arg Asp 20<210> SEQ ID NO: 305 <211> LENGTH: 23 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Descriptionof Artificial Sequence:primer Nir Cla-73 <400> SEQUENCE: 305 ggcggacatcaaacctactt agc 23 <210> SEQ ID NO: 306 <211> LENGTH: 24 <212> TYPE: DNA<213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1n2 <400>SEQUENCE: 306 gtaacatta aggcctttcc tttt 24 <210> SEQ ID NO: 307 <211>LENGTH: 23 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer Nir-C-2-S-N <400> SEQUENCE: 307 cggtcatcaa gtgagttatgaag 23 <210> SEQ ID NO: 308 <211> LENGTH: 19 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Description of Artificial Sequence:primer cer1s659 <400> SEQUENCE: 308ggtccaatcg gcaatgagt 19 <210> SEQ ID NO: 309 <211> LENGTH: 22 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1ns10596n<400> SEQUENCE: 309 gtccaatcgg caatgagtag ag 22 <210> SEQ ID NO: 310<211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence<220> FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer Nir La-4Cla-S-S <400> SEQUENCE: 310 gtgtgcctaacagtttccgc ac 22 <210> SEQ ID NO: 311 <211> LENGTH: 24 <212> TYPE: DNA<213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1ns10265n<400> SEQUENCE: 311 tctcggagat ggtgccatat cagc 24 <210> SEQ ID NO: 312<211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence<220> FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer fie3cds5′.seq <400> SEQUENCE: 312 atgtcctctg gagagcagaaggaagagtcg ttttacacgg 40 <210> SEQ ID NO: 313 <211> LENGTH: 24 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1ns10129n<400> SEQUENCE: 313 tctggagagc agaaggaaga gtcg 24 <210> SEQ ID NO: 314<211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence<220> FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer cer1ns10030n <400> SEQUENCE: 314 cgagtcattg acgtcaacagtg 22 <210> SEQ ID NO: 315 <211> LENGTH: 24 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Description of Artificial Sequence:primer cer1ns9922n <400> SEQUENCE:315 ctcgcaaatg tgcagagtct tgtg 24 <210> SEQ ID NO: 316 <211> LENGTH: 20<212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223>OTHER INFORMATION: Description of Artificial Sequence:primer cer1n1570<400> SEQUENCE: 316 aggtcatcgc tatgaagttc 20 <210> SEQ ID NO: 317 <211>LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer cer1ns98f9511n <400> SEQUENCE: 317 gctagttgtg gtatggacac20 <210> SEQ ID NO: 318 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Descriptionof Artificial Sequence:primer cer1ns98f9311s <400> SEQUENCE: 318cacatggact gatgatccat c 21 <210> SEQ ID NO: 319 <211> LENGTH: 20 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1s2099 <400>SEQUENCE: 319 gtaaccgttg gtttggtgat 20 <210> SEQ ID NO: 320 <211>LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer Nir E-4-N-N <400> SEQUENCE: 320 ggttagtaag tcaatgatggttaag 25 <210> SEQ ID NO: 321 <211> LENGTH: 19 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Description of Artificial Sequence:primer cer1ns8795n <400> SEQUENCE:321 gcgataggta atcagagag 19 <210> SEQ ID NO: 322 <211> LENGTH: 20 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Description of Artificial Sequence:primer cer1ns8517n <400>SEQUENCE: 322 ctgtaatcag gcaaacagcc 20 <210> SEQ ID NO: 323 <211>LENGTH: 20 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Description of ArtificialSequence:primer cer1ns98f8483s <400> SEQUENCE: 323 cagccatgtc tgtcgatgga20 <210> SEQ ID NO: 324 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Descriptionof Artificial Sequence:primer fie3cds3′.seq <400> SEQUENCE: 324atccatcttc tctctcacca atgcagtgaa aatttcttaa 40

What is claimed is:
 1. An isolated double stranded nucleic acid moleculecomprising a polynucleotide that: a. specifically hybridizes to SEQ IDNO:3 in a buffer of 40% formamide, 1M NaCl, 1% SDS at 37° C., followedby one wash for 20 minutes in 0.2×SSC at a temperature of about 50° C.;and b. enhances endosperm development in the absence of fertilizationwhen the polynucleotide is operably linked to a plant promoter toinhibit gene expression and introduced into a plant.
 2. The isolateddouble stranded nucleic acid molecule of claim 1, wherein thepolynucleotide is at least 100 nucleotides in length.
 3. The isolateddouble stranded nucleic acid molecule of claim 1, wherein thepolynucleotide comprises SEQ ID NO:3.
 4. The isolated double strandednucleic acid molecule of claim 1, wherein the polynucleotide encodes aFIE polypeptide.
 5. The isolated double stranded nucleic acid moleculeof claim 4, wherein the FIE polypeptide comprises SEQ ID NO:4.
 6. Theisolated double stranded nucleic acid molecule of claim 1, wherein thepolynucleotide is operably linked to a plant promoter.
 7. The isolateddouble stranded nucleic acid molecule of claim 6, wherein the plantpromoter is tissue-specific.
 8. The isolated double stranded nucleicacid molecule of claim 7, wherein the plant promoter is ovule- orembryo-specific.
 9. The isolated double stranded nucleic acid moleculeof claim 6, wherein the plant promoter is from a FIE3 gene.
 10. Theisolated double stranded nucleic acid molecule of claim 6, wherein thepolynucleotide is operably linked to the plant promoter in an antisenseorientation.
 11. The isolated double stranded nucleic acid molecule ofclaim 6, wherein the polynucleotide is operably linked to the plantpromoter in a sense orientation.
 12. A transgenic plant comprising anexpression cassette comprising a plant promoter operably linked to thepolynucleotide of claim
 1. 13. The transgenic plant of claim 12, whereinthe polynucleotide is at least 100 nucleotides in length.
 14. Thetransgenic plant of claim 12, wherein the polynucleotide comprises SEQID NO:3.
 15. The transgenic plant of claim 12, wherein thepolynucleotide encodes a FIE polypeptide.
 16. The transgenic plant ofclaim 15, wherein the FIE polypeptide comprises SEQ ID NO:4.
 17. Thetransgenic plant of claim 12, wherein the polynucleotide is operablylinked to a plant promoter.
 18. The transgenic plant of claim 17,wherein the plant promoter is tissue-specific.
 19. The transgenic plantof claim 17, wherein the plant promoter is ovule- or embryo-specific.20. The transgenic plant of claim 17, wherein the plant promoter is froma FIE3 gene.
 21. The transgenic plant of claim 17, wherein thepolynucleotide is operably linked to the plant promoter in an antisenseorientation.
 22. The transgenic plant of claim 17, wherein thepolynucleotide is operably linked to the plant promoter in a senseorientation.